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deploying Microsoft's SQL Server 2005 BI platform—how long it took to 
get the data warehouse up and running and the challenges faced during 
implementation. 
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Demystifying SQL Server 


Virtualization 


ecause SQL Server can be I/O intensive, it 

hasn't been at the top of the list of server 
types you’d want to virtualize. However, with the re- 
cent release of Microsoft’s Hyper-V and VMware’s 
new ESX Server 3.5, SQL Server virtualization is 
becoming increasingly interesting. Performance 
for many deployments is proving to be perfectly 
acceptable, especially in light of the benefits 
virtualization can offer—such as increased hard- 
ware utilization. One remaining concern about 
SQL Server virtualization is that confusion has 
grown around virtualization itself. I want to clear 
up some of that confusion. 

First, some people think that the expression 
“virtualizing SQL Server” implies that you have to 
do something to SQL Server to make it run in a 
virtual environment. That’s not the case. From a 
management and implementation standpoint, 
there’s absolutely no difference between SQL 
Server running in a virtual machine (VM) or SQL 
Server running on a physical box. To learn more 
about virtualizing SQL Server, see “Virtualizing 
SQL Server,” May 2007, InstantDoc ID 95349. 

Next, confusion arises around the difference 
between named instances (a SQL Server feature 
that is not widely understood) and virtualization. 
Named instances allow you to run multiple copies 
(or instances) of the SQL Server database engine 
simultaneously on the same server. You typically 
find named instances in hosting environments. 
Named instances share the same server and OS 
and provide better performance than virtualized 
servers because they run directly on the physical 
server. SQL Server virtualization enables differ- 
ent virtualized SQL Server instances to run with 
different hardware and OS configurations and 
provides more flexibility for recovery because the 
VM containing SQL Server is insulated from the 
underlying hardware platform. However, because 
SQL Server in a VM runs on virtual hardware, the 
performance doesn’t match the performance that 
named instances can achieve running on physical 
hardware. 

One source of confusion is that there are three 
different types of virtualization, and not all vir- 
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tualization platforms are created equal. Hosted 
virtualization products, such as Microsoft Virtual 
Server 2005 and VMware Server, work by running 
virtualization software on top of a host OS. A new 
generation of hypervisor-based virtualization plat- 
forms provides a small special-purpose layer of soft- 
ware, called the hypervisor, that runs directly on the 
hardware, isolating VMs from the underlying OS. 
By running directly on the hardware, a hypervisor 
provides better performance, reliability, and security 
than hosted virtualization solutions. Examples of 
hypervisor-based virtualization are VMware ESX, 
Microsoft Hyper-V, and Citrix XENServer. 

A third, less well-known type of virtualization 
product uses OS-level virtualization. With OS-level 
virtualization, it’s the OS that’s virtualized instead 
of the hardware level. OS-level virtualization offers 
performance and hard- 
ware utilization benefits 
just as hypervisor-based 
virtualization does, but 
its ties to the underlying 
OS makes it less flex- 
ible than both hosted 
and _ hypervisor-based 
virtualization products. 
OS-level virtualization 
has the advantage of 
low overhead, but it also 
requires each virtual OS 
(or container) to be the same and the hardware con- 
figuration for each container to be the same. Paral- 
lels Virtuozzo Containers is an example of an OS- 
level virtualization solution. 

1 hope Гуе addressed some of the common 
questions and misconceptions that surround 
SQL Server and virtualization. Once you under- 
stand the available virtualization options for SQL 
Server, you can start to examine where they might 
fit into your environment. If you have questions 
about virtualizing SQL Server or have a story to 
tell about your company's move to SQL Server 
virtualization, send us an email at letters@sqlmag 
.com or visit our SQL Server forums. [SQL | 
== InstantDoc 1D 99841 


equal.” 


Michael Otey 


mikeo@ sqlmag.com) is technical director 
for Windows IT Pro and SQL Server 


Magazine and author of Microsoft SQL Server 
2008 New Features (Osborne/McGraw-Hill). 


*One source of confusion 
is that there are different 
types of virtualization, 
and not all virtualization 
platforms are created 
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ТОР 10 TIPS 
ብ PELO መመ ለመ “የ የሮ ረር 
Implementing г BI Solution 


There are many more than 10 tips to consider when implementing a BI solution but 

| chose these 10 based on my many years of experience working with business and 
technology professionals with a wide range of BI experience. Some of these tips are 
very specific, referring to a single property or technique within Analysis Services. 
Others are more general, providing guidance on approach or strategic direction. 
Used together these techniques will help you design and implement a BI Solution 
that doesn’t just work but works well. 


Learn MDX. МОХ is the fundamental language of Analysis Services—it's 

the SQL of the multi-dimensional environment. From the expressions that 

calculate the most valuable business metrics to KPls, dynamic security, and 

report queries your ability to effectively use MDX will dictate how much of 
Analysis Service’s power you are able to unleash. Yes, MDX has a steeper learning 
curve than SQL but stick with it; once MDX starts to make sense a whole new world of 
capability will open for you. 


Abstraction, abstraction, abstraction. Bl is a very dynamic environment 
with interaction and dependencies across and between ETL, Database, 
Reporting, and Business Analysis teams. In a perfect world all related 
initiatives would flow seamlessly into one another; but we rarely live in a 
perfect world. Use named queries and named calculations in the Analysis Services 
Data Source View to model the future state of the data mart. This will allow you to 
continue cube and, subsequently, report design while the database team completes 
its work. Consider living by this rule: no application, including Analysis Services, 
Will access data mart tables directly. Create views for all Data Mart tables—even if 
the views are of the select * from variety—as a way of protecting your solution from 
schema changes that will inevitably come in the future. 


Use non-empty behavior. An often overlooked capability of Analysis 4 


Services 2005 is the Non-Empty Behavior for calculated members. ፇ “ሥሥ 
L : 


Configuring this behavior properly can provide huge improvements in query 
performance by eliminating unnecessary evaluations of calculated members. ሠ y 4 ፡ ምሪ 


One simple technique is to use the Row Count measure automatically created with 

a new Measure Group in the Non-Empty Behavior property of related calculated ቃ ሪ 
members. If the row count measure is empty, there аге по fact rows and the expression 

need not be evaluated. ሠሪ 


Reports are not an end-point. Though some reports always will be printed ቀ 
or rendered into static forms the greater power of contemporary reporting is e. L 
to think of reports not as end-points but as touch-points on-an analytic path. + Ф ም. 
Interactive reports that allow users to access other reports ог applications— F 

in context of their current selection—create a rich, cohesive information e 


environment. 


Provide both drill-down navigation and cross-axis analysis. You can 
measure the success or failure of a BI solution by user acceptance and 
usage. Given this, it’s vital that the solution provide the navigation and 
analysis capabilities desired by all user communities. In general, navigation 
and analysis can be defined as drill-down or cross-axis. In Analysis Services this 
means creating two sets of attributes from a single dimensional column: one for use 

in a User Hierarchy (drill-down) and one for use as an Attribute Hierarchy (cross-axis). 
Setting the User Hierarchy-based attributes as non-visible is a nice touch that simplifies 
the user interface. 


Real-time BI. When considering whether real time is a legitimate requirement 
for your BI solution remember that the definition of real time is a function of all 
the time it takes from transaction inception in an operational system, through 
ETL into the DM, and finally into a cube. If transactions are received once 

at the end of the business day there's little benefit in enabling Proactive Caching or a 
ROLAP BI design for real-time Bl. When real time is possible it can be highly valuable 
but real time between the data mart and the cube is often the least important step in 
the larger solution. 


©? 


Organic data marts. When scoping and designing your data mart remember 
that it’s an evolving, organic environment. Assume that the data mart schema 
will change in response to changing business needs. This ability to evolve is, 

perhaps, the most important capability of the data mart and is at the heart of 

design concepts such as abstraction and limited scope, iterative deployment cycles. 


~, 


Key attributes and slowly changing dimensions. Often, users require a 
current and historical view of slowly changing dimensions. This can be easily 
supported in Analysis Services through the use of Key Attributes referencing 
surrogate keys, and meaningful name column references, in conjunction with 
а second attribute using the same name column reference as both its key and name. 
Creating two user Hierarchies—Current with the non-Key attribute as the leaf and 
Historical with the Key attribute as its leaf and no reference to the non-Key attribute— 
allows users to select whether they'd like to see dimension members in their historically 
accurate relationships or as a single member in its current relationship. 


oO 


The Exists() function. Analysis Services models can be very large and MDX 
expressions that involve set manipulation can be painfully slow. Reducing 

the size of the sets used in these expressions can significantly improve 
query performance. Creating attribute relationships as, essentially, member 
properties (think, for example, of Title as an attribute directly related to Employee 
Name) and limiting the set of employees using the Exists() function can often reduce a 
set of several thousand to a set of a few dozen members. 


Don't lose focus on the business purpose. Building a Bl solution 
from the “data out" (taking all available data into the warehouse 
assuming that it will then be possible to answer any question) is an 
expensive mistake. This technique is front loaded with implementation 
cost and inevitably produces a solution that fails to deliver the anticipated business 
value. Starting with planning and continuing through design and implementation, each 
investment in a BI solution should have a clear relationship to a specific business need 
with a clearly defined method of delivery (e.g., an interactive report, a browser- 
based dashboard, a Data Mining query capability to segment new customers). 


Get Rid of Gaps 


in Numerical Sequences 


n common for a table to contain an integer column 
thats used as the table's primary key or part of 
the primary key. An integer column can be a simple 
sequence of numbers (e.g., 1, 2, 3, ...) ora more com- 
plex sequence that includes numbers representing spe- 
cific information such as a year (e.g., 2008001, 2008002, 
2008003, ...). One of my clients uses an integer column 
that falls into the latter category. This company main- 


tains a database 
ORE on the WEB 


of customers 
Download the listings at who are iden- 
InstantDoc ID 99797. tified by a 
numeric ID. 
Each customer ID reflects the year in which the cus- 
tomer record was added to the database, the numeric 
ID of the office that added the customer record, and 
an integer. Each year, all the offices are assigned a 
non-overlapping range of integers to use in the cus- 
tomer IDs. 

Believing that a gapless sequence of customer IDs 
helps instill a high level of confidence in the customer 
list, the company’s CEO requested that the numbers 
be maintained so that there are no gaps. In this case, 
a gap is said to exist when the difference between two 
consecutive integers in a sorted sequence is more than 
one. For example, the sequence 1, 2, 4, 5 has a gap 
between 2 and 4. 

Although companies might want integer sequences 
in database applications to be gapless, gaps occur fre- 
quently. Gaps can occur because of the assignment 
process. An automated process might produce gaps 
because of the logic used or an untrained user gener- 
ates more numbers than needed. Manual assignment 
systems are notorious for generating gaps due to user 
errors. Gaps can also occur because of inadequate 
table maintenance. For example, a number might be 
assigned to a record in a table, but later the record is 
deleted. 

To meet the CEO’s request, I began working on a 
system to check for and eliminate gaps. I first created 
a script, GenerateGaps.sql, that generates a tempo- 
rary table (##TempTbl) containing around 1,000 
integers with gaps. GenerateGaps.sql begins by gen- 
erating a simple table that has as an integer identity 
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column and a tiny integer column. Next, the script 
adds one record to the table, then adds more records 
through recursion. In the first recursion, it copies the 
existing record and adds that record to the table. In 
the second recursion, the script copies the two existing 
records and adds them to the table. The recursion 
occurs 10 times, so the table is populated with 1,025 
records (1 through 1025). The script then deletes three 
records—the records whose identity column values 
are 250, 500, and 750. Finally, the script creates and 
populates a table (##TempRange) with the two range 
values of 300 and 600. (The purpose of the range 
values will be discussed shortly.) 

In GenerateGaps.sql, you can adjust how many 
records are generated if more or fewer records are 
desired. The number of records generated is the sum 
of a progression of powers of 2 (2! + 2? + 2? + 2" and 
so on), so 20 recursions produce more than a million 
records. You can also adjust how many records are 
deleted and change the range values in the ##Temp- 
Range table. 

With the integers generated, my next task was to 
write a script that would search a specified range of 
integers in a sequence to find the first gap (1.е., the 
lowest missing integer) or the next available integer. 
FindGap.sql in Listing 1, page 10, is the result. 
Here's how this script works. In the code at callout 
A, FindGap.sql retrieves the range values from the 
##TempRange table, then generates a sequence 
of integers within that range. So, in this example, 
FindGap.sql creates 301 integers that start at 300 and 
end at 600. 

Next, FindGap.sql compares the newly created 
gapless sequence of integers with the same range of 
integers from the ##TempTbl table. As callout B 
shows, one of several actions will occur based on the 
result of this comparison. When a gap is found, the 
script returns the missing integer. When no gaps are 
found, the script uses the MAX function to obtain the 
maximum value in the range, then adds 1 to that value. 
The resulting value becomes the next available integer 
unless one of the following conditions is met: 

* If the resulting value is null, no integers from the 
##TempTbl sequence are in use, so the script 
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returns the first integer in the range. 

• If the resulting value is the last number in the 
range, the script returns the negative version of 
that number. 

• If the resulting value is greater than the highest 
value in the range, all the integers from the 
##TempTbl sequence are in use, so the script 
returns a 0. 


By varying the range values in GenerateGaps.sq], 
you can see how this part of FindGap.sql works. For 
example, GenerateGaps.sql provides the range values 
of 300 and 600, which means FindGap.sql will return 
a value of 500 because that's the first gap. If you 
change the range values in GenerateGaps.sql to 1500 
and 2000, FindGap.sql will return a value of 1500 
because no integers in the range have been used so 
far. If you change the range values in GenerateGaps 
.sql to 800 and 1025, FindGap.sql returns a value of 
-1025, which means the last integer in the range is the 
next available integer in the sequence. If you change 
the range values in GenerateGaps.sql to 800 and 1000, 


FindGap.sql returns a value of 0 because all the num- 
bers in the range have been used. 

As callout C shows, once determined, the next 
available integer is returned using the sp executesql 
stored procedure with an output parameter. You can 
send a message to a DBA based on the value returned. 
For example, if a 0 or a negative integer is returned, 
the message can inform the DBA that the ranges in 
the table need to be updated so that new numbers can 
be created. If a positive integer is returned, there is no 
need to send a message. 

You can download GenerateGaps.sql and 
FindGap.sql by going to www.sqlmag.com, entering 
99797 in the InstantDoc ID text box, clicking Go, 
then clicking the 99797.zip hotlink. These scripts 
work on SQL Server 2005 SP2 and SQL Server 2000 
SP4. I tested FindGap.sql’s performance on SQL 
Server 2005. With a range containing around 100,000 
integers, FindGap.sql found the first gap and the next 
available integer in about 1 second. When the range of 
integers was increased to around a million, FindGap 
„за! found the first gap and the next available integer 


LISTING 1: FindGap.sql 


USE tempdb 


DECLARE (85 пуагсһаг (4000) ,@newline nvarchar(2) 
DECLARE (cr nvarchar(2) 
SET @cr=nchar (19) 


-- Get the range from the ##TempRange table. 
DECLARE @п1 int,@n2 int 
SELECT @n1=Range1,@n2=Range2 FROM ##TempRange 


-- Create the integer sequence within that range. 
DECLARE (81541 varchar (1000) 
SET Q(iSql-'SELECT TheInteger FROM ##TempTb1'+@cr 
SET @iSql=@iSql+'WHERE TheInteger BETWEEN ' 
+CONVERT (магсћаг (12) ,@n1)+' AND ' 
+CONVERT (varchar (12) ,@n2)+@cr 
SET @iSql=@iSql+'ORDER ВУ TheInteger '+@cr 


-- Compare the newly created gapless sequence 
-- with the sequence in ##Тетртб1. 
SET 65='' 
SET @s=@s+'declare @i int'+@cr 
SET @s=@s+'declare @t table(X int IDENTITY('+CONVERT 
(МАКСНАК (12) ,@n1)+',1) Primary Key,TheInteger int) '+@cr 
SET @s=@s+'INSERT INTO @t'+@cr 
SET @s=@s+@iSql+@cr 
SET @s=@s+'SELECT @i=min(X) FROM Gt '+@cr 
SET @s=@s+'WHERE X<>TheInteger'+@cr 
-- No gaps are found. 
SET @s=@s+'SET @i=case WHEN Qi IS NULL THEN '+@cr; 
-- Gap is found. Use as next available integer unless... 
SET @s=@s+' (SELECT MAX(X)+1 FROM Gt) ELSE Qi END'«Gcr; 
-- . no numbers within the range are in use. 
SET @s=@s+'SET @i=case WHEN Qi IS NULL THEN ' 
«CONVERT (varchar (12) ,@n1)+@cr 
-- . the last number in the range will be used. 
SET @s=@s+'when @i='+convert(varchar(12) ,@n2)+' then -@i '+@cr 
-- . no numbers аге left in the range. 
SET @s=@s+'when @i>'+convert(varchar(12) ,@n2)+' then Ø '+@cr; 
SET @s=@s+'else @i end'+@cr 
SET @s=@s+'SET @x=Gi' 


-- Print the results. 

DECLARE (rc int,@x int,@Parms nvarchar (10) 
SET @Рагтѕ='@х int Output' 

EXEC sp_executesq] (s, @Parms,@x=@rc Output 
SELECT @rc as TheMissingInteger 


in about 12 seconds. These results 
indicate that a range in the low 
hundreds of thousands is the most 
efficient. 

There are many ways you 
can adapt GenerateGaps.sql and 
FindGap.sql to meet your needs. 
For example, you could adapt them 
to use sequences in which the inte- 
gers have differences larger than 
one (e.g., even or odd integers) by 
increasing the difference between 
consecutive integers. You could use 
a numbering scheme that includes 
duplicate integers by incorporating 
a summary view that changes 
the duplicate integers to unique 
integers. 

Instead of using FindGap.sql 
as separate script, you could use 
the code as part of another script 
or convert it to work as a stored 
procedure with an output param- 
eter. If you don't want to use a table 
to provide the range values, you 
could assign @n1 and @n2 values 
that reflect the desired range and 
remove the statement that selects 
the range values from the range 
table. As you can see, the possibili- 
ties are virtually endless. ፳፲ 
— Коу Byrd, principal consultant, 

Byrd Associates 
InstantDoc ID 99797 
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the past few years, Microsoft has re- 
| n leased several business intelligence (BI) 
products, such as SQL Server 2005, the 
Microsoft Office 2007 suite, Microsoft Office Share- 
Point Server (MOSS) 2007, and Microsoft Office Per- 
formancePoint Server 2007. However, it wasn't until 
recently that companies began to feel business pres- 
sure to think about how their bottom line is affected 
by the quality and timeliness of the information that 
their employees use to make decisions. Microsoft and 
its competitors offer BI products to fill this need, and 
Microsoft touts its ability to provide an end-to-end 
BI solution based on its SQL Server data platform. 
With SQL Server 2008's enhanced BI features, many 
companies are considering implementing Microsoft's 
BI platform to improve access to meaningful data. To 
gain insight into how organizations decide whether to 
deploy Microsoft's BI solutions and to see how these 
solutions can be implemented, I spoke with Ron Van 
Zanten, the directing officer of BI at Premier Bank- 
card, to get an inside look at how he implemented 
Microsoft's SQL Server 2005 BI platform in his or- 
ganization, including how long it took to get the data 
warehouse up and running and the challenges that he 
faced during the implementation. 


Bearly: What made your company decide to 
implement Microsoft's BI solution? 


Van Zanten: We were like a lot of companies are 
when they're starting out. We had reporting, and indi- 
vidual divisions had systems on which they would keep 
their data. We did have a data warehouse at the time. It 
was good except for two flaws—the information wasn't 
timely or accurate. We needed to break down the si- 
los of information and get one version of the truth, so 
that everybody was working off the same set of books. 
That's why we chose a centralized data repository. 


Ng 


Bearly: Did your company look at alternative BI 
platforms before deciding to use Microsoft’s? 


Van Zanten: Yes, we looked at Oracle and Tera- 
data. Teradata had only a limited number of instal- 
lations, it’s extremely expensive, and they wanted to 
have their consultant come in and set up the data 
warehouse, set up the schema to one of the sche- 
mas they have for financial institutions, and then 
build reports off of it. The downside of that is that 
we wouldn't have the ownership for the implementa- 
tion. Oracle didn’t have a BI stack and told us to use 
Analysis Services for the cubes. So it looked like we 
were going to use the top part of the Microsoft stack 
anyway. Since Analysis Services was in all three pro- 
posals, we decided to use Microsoft’s BI platform. I 
didn’t have any reservations with SQL Server. I knew 
we could build a schema and leverage SQL Server’s 
strengths and make it perform like we needed it to. 
As long as we started from the ground up, built it and 
designed it with performance in mind, and put a pre- 
sentation layer on the top of it, we were certain SQL 
Server would be able to perform. 


Bearly: Which Microsoft BI products are you using? 


Van Zanten: SQL Server 2005 SP2. We're running 
the latest version of Reporting Services [2005 SP2], 
and Analysis Services has the latest service pack [2005 
SP2] on it, too. We're running PerformancePoint 1.0, 
which is the first build. We can't wait for Performance- 
Point 2.0. We use SharePoint 2007 and we did use 
ProClarity 6.1, but we're migrating over to Perfor- 
mancePoint for that presentation layer. 


Bearly: What features made you use those products? 


Van Zanten: The partitioned tables were something 
that we really wanted, and we really wanted our data- 
base platform to be NUMA [Non-Uniform Memory 
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Access] aware. NUMA is a hardware feature where 
we have four cells on our Itanium box, and those cells 
have four processors and 16GB of memory on each 
one. SQL Server 2005 is aware of that, so we don’t 
allow people to take more than one cell of resources 
when they’re running their queries. We have one fact 
table that’s 2TB; it’s got almost three billion records 
in it. You could write a query to our data warehouse 
and its hardware, and SQL Server keeps a query like 
that from overrunning everything else. The query will 
go through, and we'll eventually find it and be able to 
stop it before it wastes everybody’s time. But in the 
meantime, the SQL Server instance is still available 
and still has a decent response time. 

We monitor our data warehouse by response time. 
We run a query against our Account dimension; we 
query the dimension table, returning one record back. 
We track the time in milliseconds that it takes to log 
in, return the data, and log out. We do this every 10 
minutes. I use that metric above all others because that 
really tells me how SQL Server feels about things. If it 
has CPU pressure, that query will be slower; if there’s 
a lot of memory pressure it’s slower; or if there's disk 
pressure it's slower. If the response time starts to de- 
grade, then I can see where the bottleneck is because 
it's usually one of those three things that the SQL 
Server system is unhappy about. Our standard is that 
a query needs to happen in 250 milliseconds on the 
SQL Server system. 


Bearly: You mentioned that you have a centralized 
data repository. Can you tell me more about your data 
warehouse? 


Van Zanten: We created a centralized data ware- 
house, where everything is stored and loaded through 
a standardized ETL [extraction, transformation, and 
loading] process. The ETL process is where the data 15 
validated and cleaned. Our data warehouse is a rela- 
tional star—it's actually a snowflake schema—where 
the different fact tables and different core areas of 
our business can be related to each other correctly. 
We look at letters that are sent to the customer, phone 
calls the customer makes to us, and the problems that 
they have, in the same context as other things that 
are happening in our company, so that we can get a 
360-degree view of the customer. But that really only 
works well when you have a centralized data store. It 
does cause new problems because you're inviting new 
challenges. One of those challenges is size, by having 
everything in one area—but we have a 12TB SQL 
Server instance that runs this data store. 
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Bearly: Can you tell me more about that SOL 
Server instance? 


Van Zanten: It's 1218, and it contains over 40 bil- 
lion records. If you think about it in the big picture, 
it actually makes our data stores smaller because we 
aren't keeping separate copies of data; everything 
is located in one spot. If you want to get monetary 
transactional history, such as payments, purchases, or 
fees, you can go to one place that's been audited, bal- 
anced, and built on a repeatable process that's gone 
through change control. You have a place where you 
can trust the information. Reports can then be built 
on the information, and our business users can build 
data marts and subject areas to drill down further and 
do further analyses, but it's always based on the origi- 
nal version of the truth. It gives everybody a point of 
reference for reports, and analysts can drill back and 
verify such information as "Where's the baseline? Do 
I balance? Is my report correct?" 


Bearly: How difficult was it to implement 
Microsoft's BI platform? 


Van Zanten: The most important thing was under- 
standing the platform, in order to leverage it. We had to 
have the right information but it also had to be timely. 
ETL performance was definitely a consideration. Our 
ETL process runs through SSIS on a separate server, 
and we bulk INSERT into the actual database instance 
all of the data. 

Having our data warehouse and SSIS on the same 
SQL Server instance probably wouldn't have been the 
best scenario, because both of those products love 
memory, and we didn't want to have that kind of con- 
tention. So having a dedicated ETL box was impor- 
tant for us, although it’s something that companies 
sometimes don't do because they think that if they go 
with the same vendor, they can put it all on one box. 
But if we had purchased a separate ETL tool from a 
third-party vendor, we would have had it installed on 
a separate server anyway. So we took best practices for 
the industry and applied them to our stack. 

We have a separate SSIS server whose dedicated job 
is to manage the data movement, the data quality, and 
the checks, which is a lot of work because we actually 
take in data and put it in a dirty table, and then we 
start to normalize it to get it ready to put into our data 
warehouse. We put the keys in there for our dimension 
tables on the fact tables, and we standardize data types. 
We're looking for things that are out of the ordinary. 
We put 20 to 25 million records in the data warehouse 
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per day, but we inhale into our ETL server well over 30 
million. Then we get the data distilled down, cleaned, 
and standardized. The data warehouse then accepts 
this new information and the ETL box will kick off 
Reporting Services schedules. 

Of 180 loads, about 120 of them are just daily 
incrementations that run overnight. They contain the 
activity for the previous day. We only post transac- 
tions to credit cards at night, so that load only has 
to happen once every evening and then it's done for 
the day. The data isn't going to change during the day 
in the data warehouse. Then we kick off Reporting 
Services scheduled reports and they run against that 
data, using snapshot reports, so that when people 
look at those reports during the day in Reporting Ser- 
vices, the query has already been run against the data 
warehouse. Users think performance is extremely fast 
because the only thing Reporting Services has left to 
do is just to render the data. Reporting Services can 
render that data to 500 people quickly because it's do- 
ing some web pages without having to touch the data 
warehouse itself—because we know in these particu- 
lar reports that the data hasn't changed, which saves 
the data warehouse from having repeating hits from 
all those reports. 

Analysis Services also improves BI performance. 
Our Analysis Services server is separate, which pro- 
vides some relief for the data warehouse. АП the hits 
are done by the Reporting Services and Analysis Ser- 
vices servers. There are also some loads to the data 
warehouse that occur every 15 minutes or every half 
hour for load frequencies, and the data warehouse 
has to service them. 

We use and leverage Microsoft's stack pretty heav- 
ily. Every database engine has things it's good at and 
things it's not good at. A good DBA, a good schema, 
and a good process should leverage a database en- 
gine's strengths. That's what we do with SQL Server. 
It performs very well—queries take less than a minute 
on average, and we never have a query that takes over 
the SQL Server box. The data warehouse takes abuse 
constantly and it performs very well. 


Bearly: Did you implement your BI platform 
inhouse? Or did you contract with someone or work 
with Microsoft directly to set it up? 


Van Zanten: We did it ourselves. 


Bearly: What was the biggest technical hurdle that 
you had to overcome? 


Van Zanten: When we were implementing it, our 
ETL process was done in DTS (we used DTS for 
order of precedence and dependencies), but almost 
all of the heavy lifting was done by SQL Server and 
stored procedures. In that space, the biggest technical 
hurdle was data quality and getting things married 


up. DTS wasn't nearly as robust as SSIS is at sorting 
and weeding things out, so we had to do it by hand. 
You had to run SQL queries when cleaning queries, 
and look for inconsistencies. 


Bearly: Did you encounter any problems other than 
with DTS? 


Van Zanten: The quality of the data and the time- 
liness of our data sources is our biggest challenge. 
The other big challenge is having people understand 
the data warehouse—what's in there, what's not in 
there—as well as understand our process, such as 
knowing how to use Reporting Services and under- 
standing how to pivot a cube. Cubes seem to be a leap 
for people. We could have the best data warehouse 
ever, but if nobody ever used it for their work, then it 
would be a failure. 


Bearly: How long was it from when you started 
building your data warehouse to when it was functional 
for your users? 


Van Zanten: Four months. The first high-value 
test reports were published in six months. 


Bearly: Were you able to stay within the budget you 
were given? 


Van Zanten: We were definitely within budget. 
Our first major purchase was the HP Itanium 8620. 
And we had a very small staff; it was actually just me 
the first year. I was the whole development staff on 
this project. Even now my staff includes only eight 
people who are dedicated to the BI data warehouse 
infrastructure, so it’s still fairly economical to run if 
you set it up correctly. The really giant purchase we 
made was the HP StorageWorks XP24000, because 
it was a seven-figure purchase. But we just bought 
that last year, so by that time we were a billion-dollar 
company, and all this corporate data was giving us a 
700 percent return on investment; we could afford it. 
We were able to deliver, which is huge, and our data 
repository was delivered on time and correctly. It's 
really important to think about how you're going to 
show your return on investment. 


Bearly: Are you planning to migrate to SQL Server 
2008 when it's released? 


Van Zanten: We have SQL Server 2008 on a 
development box and for our test data warehouse. 
Our largest fact table is 2TB; but 700GB would 
likely go away if we could take advantage of SQL 
Server 2008's compression. When SQL Server 2008 
is released, we plan to upgrade our production data 
warehouse. 500 
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JOB TRACKING 


Track down and solve job-scheduling 
problems using two helpful scripts 


Server jobs are useful for 
automating and scheduling 
the execution of database pro- 
grams. The SQL Server Agent service is responsible 
for tracking job schedules and launching SQL Server 
jobs on time. Yet sometimes jobs might not start 
according to schedule. Compounding this problem is 
that SQL Server doesn't report dates and timestamps 
for scheduled jobs that didn't occur, so it's hard to find 
this information. Another problem occurs when a SOL 
Server job has multiple schedules that reference dates 
and times that are the same or close to one another. 
This situation could result in a job being called multiple 
times within seconds, which could cause execution 
errors. 

To solve these problems, Гуе written two T-SQL 
scripts, JobScript FindMissedJobs.sql and JobScript_ 
FindConflictingJobSchedules.sql, which find missed 
jobs and identify conflicting schedules. (You can 
download these scripts on this articles web page at 
InstantDoc ID 99593; click the Download the Code link 
to download a zip file containing these scripts.) Let's 
look at the job-scheduling problems more closely and 
explore how the scripts work. 


Job-Scheduling Issues 

One obvious cause of а job’s failing to start is when SQL 
Server Agent isn't running. A more interesting example 
can occur with long-running jobs. Say that a job is 
scheduled to run at datetimel, then again at datetime2. 
If the job starts running at datetime] and doesn’t finish 
executing by datetime2, the job won't fire at datetime2. 
After datetime2, the job will be called again at the next 
scheduled date and time. So, a key problem with job 
scheduling is not being able to find dates and timestamps 
of missed jobs (1.е., scheduled jobs that were skipped). 
SQL Server doesn't report this information; nonetheless, 
it's crucial data for developers and DBAs to know to 
ensure that database programs indeed run on time. 
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Another important problem is conflicting schedules. 
If one SQL Server job has two or more schedules, it's 
possible that different schedules reference the same (or 
approximately the same) dates and times. When this hap- 
pens, a job will be called multiple times within a couple 
seconds, which could result in logical and functional exe- 
cution errors. For example, assume that the first step of a 
job creates a table called msdb.dbo.utbJobExecution. If 
the first step is invoked twice simultaneously, one of the 
two running instances will result in an error because the 
table already exists in the database. Consequently, users 
must be able to tell whether scheduling overlaps exist and 
prevent such conflicts. 


How SQL Server Stores 

Job Information 

The properties and attributes of all SQL Server jobs 
on a server are stored in the msdb database. The dbo 
.Sysjobs table is available in both SQL Server 2005 and 
SQL Server 2000 (although the table schema 
has changed in 2005) and contains one row 
per job. The sysjobs table holds the fol- 
lowing information: job name, а UNIQUE- 
IDENTIFIER column called job id, which 
stores the ID of each job; an owner sid column of 
type varbinary(85), which captures the ID of the login 
owning the job (as it appears in the master.dbo.syslo- 
gins table); the enabled bit flag column, which indicates 
whether a job is enabled or disabled; and the date cre- 
ated column, which holds the job-creation date. 

SQL Server supports many scheduling options. In 
SQL Server 2000, scheduling data is stored in the dbo 
.Sysjobschedules table. Job schedules in SQL Server 
2005 are split between two tables, dbo.sysjobschedules 
and dbo.sysschedules. The dates when jobs are invoked 
(denoted by run dates) are determined by the freq type 
column (in sysschedules in SQL 2005 and sysjobsched- 
ules in SQL Server 2000). You can configure jobs to 
run once at a given date and time, every few days, every 
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TABLE |: Information in msdb That Dictates When Jobs Are Run 


Description freq_interval 

Runs once at a given date NA 

(denoted by А) 

Runs every /. days, L, where / <= 366 

starting at a given date К 

Runs every M weeks on Day vector: 

specific days, starting at Sum of 1 for Sunday, 2 for 

a given date К Monday, 4 for Tuesday, 8 for 
Wednesday, 16 for Thursday, 
32 for Friday, 64 for Saturday 

Runs on the Mth дау N, N between 1 and 31 

of the month, every P 

months, starting at a 

given date К 

Runs either on the first, Day type: 


second, third, fourth, or 
last day, every P months. 
The day can be selected 
as a specific day of the 
week, day in the month, 

a business day, or a 
weekend day. The starting 
date is denoted by К. 


1 = Sunday 
2 = Мопдау 
3 = Tuesday 


4 = Wednesday 
5 = Thursday 


6 = Friday 

7 = Saturday 

8 = day of month 
9 = business day 


10 = weekend day 


several weeks on certain days of the week, or monthly. 
Table 1 summarizes these options. In the span of each 
run-date day, the runtimes determine the timestamps 
for invoked jobs. A job can either fire once per day, 
per schedule at a user-provided time or run repeatedly 
every few minutes or hours, as Web Table 1 (www 
-sqlmag.com, InstantDoc ID 99593) shows. 


The Problem... and Its Solution 
To find missed jobs or conflicting schedules within a 
given time period, you must first be able to tell when 
the jobs are supposed to run, which isn't an easy task. 
After obtaining the scheduled run dates and runtimes, 
it's a simple matter to use the job-execution history 
stored in the dbo.sysjobhistory table to find scheduling 
conflicts or correlate the calculated dates and times, 
then find all the missed jobs. 

To locate missed jobs, you can use the JobScript - 
FindMissedJobs.sgl script. JobScript_FindMissedJobs 
.sql accepts two parameters: a start date time and an 
end date time parameter. When this script is executed, 
it finds all dates and times in which jobs were supposed 
to run in the given time window, then compares this 
information with the job-execution history. If any jobs 
didn't run on time, the script records and plots all the 
missed jobs. To identify conflicting schedules, you can 
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freq recurrence - 
factor 


run the JobScript_FindConflictingJobSchedules.sql 
script, which also accepts start date-time and end date- 
time parameters and reports all conflicting schedules in 
the provided time range. 

These scripts let users enter the start date time and 
end date time parameters for the time window to be 
examined as the variables @StartDateTimeToCalcu- 
late and @EndDateTimeToCalculate, respectively, as 
you can see at callout A in the sample script in Listing 
1. (The code in Listing 1 is an example containing 
lines of code from both the JobScript FindMissedJobs 
sql and JobScript FindConflictingJobSchedules.sql 
scripts; it's meant to illustrate the scripting techniques 
I discuss in the article.) Then, the scripts take the fol- 
lowing actions to extract the run dates and runtimes 
for all SQL Server jobs. First, a snapshot of all job 
and scheduling information is saved into a temporary 
table called #JobScheduleInfo. Using the user-provided 
time range, each script calculates an effective time 
window for every SQL Server job. The script then loops 
through all jobs and schedules to determine the run 
dates in the effective time window, as I explain in more 
detail shortly. The scripts store the calculated dates 
in a temporary table named #JobScheduledDates. 
Next, for each run date, the script finds the runtimes 
and stores all dates and times in the temporary table 
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#JobScheduledDatesAndTimes. The 
scripts then obtain the missed jobs and 
conflicting schedules data according to 
the information in the #JobScheduled- 
DatesAndTimes table and return this 
information to the user. 


Narrowing the Time 
Window 

Let’s take a closer look at how the 
scripts work. Assume that the user 
enters 1/1/1900 for the start date and 
1/1/2500 for the end date. Clearly, 
there’s no need to reconstruct the job 
runtimes for the entire time range 
because a job couldn’t have run before 
SQL Server was conceived or in the 
future. To optimize performance, the 
scripts first calculate an effective time 
window that looks only at relevant 
dates for each job. When the job is 
initiated, the job’s creator enters the 
active_start_date and active_start_time 
information columns (in the syssched- 
ules table in SQL Server 2005 and the 
sysjobschedules table in SQL Server 
2000). A job clearly cannot run before 
the date and time specified in these 
columns. Moreover, the date_created 
column in the sysjobs table stores the 
datetime value of the job-creation date, 
and it’s obvious that jobs can’t run 
before they were created. 

For the end date, the active_end_ 
date and active_end_time columns (in 
sysschedules and sysjobschedules and 
in SQL Server 2005 and SQL Server 
2000, respectively) store the expiration 
time after which jobs cease to run. 
The scripts take these facts into con- 
sideration and calculate the effective 
minimum and maximum datetimes 
in the columns min_datetime_to_ 
consider and max_datetime_to_consider 
in the #JobScheduleInfo table. The 
effective time window is therefore the 
gap between these two datetime col- 
umns, as callout B in Listing | shows. 


Calculating Run Dates 
and Times 

After the scripts calculate the effective 
time window, they'll proceed to calcu- 


late the run dates for all jobs on the server. Jobs sched- 
uled to run once will fire one time at the date and time 
specified in the active_start_date and active_start_time 
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SQL SERVER JOB-TRACKING SCRIPTS 


LISTING 1: Modified Version of Job-Tracking Scripts 


DECLARE @StartDateTimeToCalculate DATETIME 
DECLARE GEndDateTimeToCalculate DATETIME 
DECLARE @SQLStr NVARCHAR(4000) 


(A)sET @StartDateTimeToCalculate = '1/1/2005' 


SET @EndDateTimeToCalculate = '1/1/2010' 


IF OBJECT ID('tempdb..£JobScheduleInfo') IS NOT NULL 
DROP TABLE £JobScheduleInfo 


CREATE TABLE £JobScheduleInfo ር 
job id UNIQUEIDENTIFIER, 
date created DATETIME, 
min datetime to consider DATETIME, 
max datetime to consider DATETIME) 


IF CAST(LEFT(CAST(SERVERPROPERTY ር ProductVersion') AS NVARCHAR(128)), 2) AS 
DECIMAL) « 9 
SET @SQLStr = ' 
INSERT INTO £JobScheduleInfo 
SELECT азор ак 
a.date_created, 
DATEADD(ss, 
CAST(b.active start time/10000 AS INT)*3600 
+ (CAST(b.active start time/100 AS 1МТ)%100) “60 
+ (b.active start time % 100), 
CAST(CAST(b.active start date AS VARCHAR(8)) AS DATETIME)), 
РАТЕАОО (55, 
CAST(b.active end time/10000 AS INT)*3600 
+ (CAST(b.active end time/100 AS INT)%199) *60 
+ (b.active_end_time % 100), 
CAST(CAST(b.active end date AS VARCHAR(8)) AS DATETIME)) 
FROM msdb.dbo.sysjobs a 
INNER JOIN msdb.dbo.sysjobschedules b 
ON a.job id = b.job id 
AND a.enabled = 1 
AND b.enabled = 1 ‘ 
ELSE 
SET asQEStr = * 
INSERT INTO £JobScheduleInfo 
SELECT a.job id, 
a.date created, 
DATEADD(ss, 
CAST(b.active start time/10000 AS INT)*3600 
+ (CAST(b.active start time/100 AS 1МТ)%100) “60 
+ (b.active start time % 100), 
CAST(CAST(b.active start date AS VARCHAR(8)) AS DATETIME)), 
DATEADD(ss, 
CAST(b.active end time/10000 AS INT)*3600 
+ (CAST(b.active end time/100 AS INT)9109)*60 
+ (b.active end time ?; 109), 
CAST(CAST(b.active end date AS VARCHAR(8)) AS DATETIME)) 
FROM msdb.dbo.sysjobs a 
INNER JOIN msdb.dbo.sysjobschedules z 
ON a.job id = z.job id 
INNER JOIN msdb.dbo.sysschedules b 
ON z.schedule id - b.schedule id 
AND a.enabled = 1 
AND b.enabled - 1' 


EXEC(@SQLStr) 


-- Find the largest min_datetime_to_consider 

UPDATE £JobScheduleInfo 

SET min datetime to consider = CASE WHEN date created « @StartDateTimeToCalculate 
THEN @StartDateTimeToCalculate ELSE date created END 

WHERE min datetime to consider < CASE WHEN date created < 
@StartDateTimeToCalculate THEN @StartDateTimeToCalculate ELSE date created END 


-- Find the smallest max datetime to consider 
UPDATE #JobScheduleInfo SET max datetime to consider = @EndDateTimeToCalculate 
WHERE max datetime to consider » GEndDateTimeToCalculate 


-- Set the time window to zero for all jobs with bogus configuration 
UPDATE #JobScheduleInfo SET max datetime to consider = min datetime to consider 
WHERE max datetime to consider «- min datetime to consider 


columns. For all other scheduled times (i.e. daily, 
weekly, or monthly jobs), finding the run dates is more 
complicated. 

To demonstrate this fact, consider the following 
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LISTING 2: Determining Whether a Day 
Is a Weekend or a Business Day 


DECLARE GDate DATETIME 
SET GDate = '6/23/2007' 


IF CCC&GDATEFIRST - 1) + (DATEPART(weekday, @Date) - 1)) % 7) 


IN (5, 6) 


PRINT('The date ‘ + CAST(GDate AS VARCHAR(16)) + ‘ is a weekend day’) 


ELSE 


PRINT(‘The date ‘ + CAST(@Date AS VARCHAR(16)) + ‘ 
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is a business day’) 


example. A weekly job is created on March 1, 2007 
(Thursday), at 3:00 РМ. and 15 scheduled to run every 
three weeks on Sunday and Thursday, once a day at 
11:00 a.m. After you’ve created the job, several ques- 
tions immediately come to mind. Since the job was 
created at 3:00 p.m. and is supposed to run at 11:00 
АМ., Will SQL Server Agent consider March 1 85 a 
run day (in which case the next run date is three weeks 
from today), or will it fire the job next Thursday? Will 
the Agent pick the next Sunday as a run day, or does 
it consider the last Sunday as a run day? Fortunately, 
there are relatively easy ways to reconstruct the run 
dates. 

The sysjobschedules table has two columns that 
contain the next date and time when a job will run: 
next_run_date and next_run_time. These columns are 
updated (for all jobs) when SQL Server Agent is started, 
or periodically every 20 minutes, as explained in SQL 
Server 2005 Books Online. In other words, the values 
stored in next_run_date and next_run_time can be stale 
or might not be available at all (e.g., from the time that 
a job is created until SQL Server Agent updates the 
next-run values for the new job). However, for purposes 
of finding the job run dates, these issues are irrelevant; 
the next_run_date (even if stale) gives you a date when 
you know that the job is supposed to run. When the 
job_run_date isn’t available, it means that the job was 
created in the last few minutes, in which case you aren't 
concerned about missed schedules. 

When the scripts are run, the next_run_date is 
recorded from sysjobschedules for each job and 
schedule. To get the most up-to-date values for the 
next_run_date, the scripts also call the (undocumented) 
extended stored procedure master..xp_sqlagent_enum_ 
jobs, which queries the SQL Server Agent to return the 
latest data. If the returned next-run values are later 
than those in sysjobschedules, the script overwrites the 
previously recorded values. 

Once the scripts know that a job is scheduled to run 
at a certain date, they can extrapolate all the run dates 
for each schedule. Starting at the date in next_run_date, 
the script adds the appropriate frequency interval (1.е., 
days, weeks, months) and records all run dates in the 
#JobScheduledDates table, until the datetime value 
in max_datetime_to_consider is reached. Next, the 
script returns and determines all run dates between 
next_run_date and min_datetime_to_consider. 


After obtaining the run dates, you need to cal- 
culate the times when jobs run in each run date. To 
do so, you start with the active_start_time: If a job 
is scheduled to run once a day, the active start time 
is the runtime. If jobs run every several (say x) 
minutes or hours, you add x minutes (or hours) 
to the active start time until the end of the day 
15 reached. 


Finding Missed Jobs and 
Conflicting Schedules 

The run dates and times are stored in the #JobSched- 
uledDatesAndTimes table. To find missed jobs, the 
JobScript FindMissedJobs.sql script compares the 
content of this table with the values in sysjobhistory to 
check whether the first step was invoked on (or a few 
seconds after) the scheduled date and time. You can 
now easily obtain conflicting schedules by comparing 
the dates and times in #JobScheduledDatesAndTimes. 
If two run dates and times are only several seconds 
apart (the number of seconds is configurable and is 
stored in the variable @TimeGapInSeconds), then the 
schedules indeed conflict. 


Special Situations 

Obtaining the run dates can still be challenging for 
some schedules, and the scripts address such cases. 
One such example is when, every few months, a job 
is scheduled to run on the third or last weekend day. 
The code to determine whether a day is a weekend 
or a business day depends on the @@DATEFIRST 
settings; Listing 2 shows an example of this code. The 
code in Web Listing 1 returns the nth day of the month 
(where ዘ 1s the freq relative interval value in Table 1 
that corresponds to jobs with freq. type = 32 and the 
freq interval is either 9 or 10). 


Quick Job Tracking 
You can also query SQL Server Agent scheduling 
information by using Windows Management Instru- 
mentation (WMI) or another scripting language 
besides T-SQL. The main advantage of my scripts, 
though, is that they can return missed jobs and 
conflicting schedules quickly, using simple T-SQL 
code, even when many SQL Server jobs are on the 
server. However, the scripts have a couple potential 
limitations. If you run the scripts against a large 
time window (e.g., several years), they might take a 
while to run. Additionally, the scripts don't consider 
scheduling changes. If a schedule changes at some 
point in time, the information reported for the dates 
prior to the change might be incorrect. If you keep in 
mind these minor caveats, the scripts should help you 
improve your ability to keep a closer watch on SQL 
Server job scheduling. ፳፲ 
InstantDoc ID 99593. 
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FILTERED INDEXES 
STATISTICS 


Enhance your queries and obtain 


more useful data 


S L Server 2008 introduces a very cool new 

feature—filtered indexes and statistics. 
This new feature lets you improve the performance of 
your queries without wasting disk space; offers more 
detailed and more accurate distribution statistics, 
which provide better selectivity estimates from the opti- 
mizer; and provides an alternative to the commonly 
requested unique constraint with multiple NULLs. ГЇЇ 
use several examples to introduce you to the concept 
of filtered indexes and statistics and to demonstrate 
their use. 


Filtered Indexes and Statistics 

In SQL Server 2008, you can define nonclustered 
indexes and statistics on a subset of rows rather than 
having to create them on the whole set of rows from a 
table. You achieve this task by specifying a predicate in 
a WHERE clause as part of the CREATE INDEX or 
CREATE STATISTICS statement. For example: 


CREATE INDEX idx1 ON dbo.T1(keycol) WHERE coli 
=: 


The SQL Server 2008 implementation lets you specify 
fairly simple predicates. You can use IN; the com- 
parison operators IS NULL, IS NOT NULL, =, <>, 
Iz, >, >=, !>, <, <=, Ic; and the logical operator AND. 
We can assume that Microsoft will continue to expand 
this list in future releases. Also, for now, filtered indexes 
cannot be created on indexed views, or using computed 
columns, user data types (UDTs), HIERARCHYID, 
or spatial types. 

The ability to define filtered indexes provides sev- 
eral benefits when you frequently query specific subsets 
of rows. Compared with creating ordinary indexes, cre- 
ating indexes on a specific subset of rows results in sav- 
ings beyond the obvious disk space saved. For example, 
only changes in the subset of rows in the underlying 
table will involve modifying the corresponding index 
rows. Index maintenance operations such as index 
rebuilds and reorganizations will naturally be faster 
and cheaper. Consider also the fact that the maximum 
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number of steps represented in distribution statistics 
(histograms) is limited. Because a filtered index rep- 
resents only a subset of the rows from the table, each 
step in the histogram represents fewer rows compared 
with a nonfiltered index. As a result, filtered indexes 
and statistics are more accurate. 

A scenario in which you might find filtered indexes 
useful is when you need to index only known values, 
but the column you want to index has a large number 
of NULLs and you never need to filter NULL values. 
Typically, every row from the table has a corresponding 
row in the index, including when the index key is 
NULL. But if those NULLs are not relevant to your 
queries, you will benefit from defining the index on the 
subset of rows with only the known values in the index 
key column. As an example, suppose that you want to 
create an index that will support queries against the 
Sales.SalesOrderHeader table in the AdventureWorks 
database, filtering a known currency rate ID. You 
can create an index on the subset of rows that have a 
known value in the CurrencyRateID column like so: 


USE AdventureWorks ; 


CREATE NONCLUSTERED INDEX idx currate notnull 
ON Sales.SalesOrderHeader(CurrencyRateID) 
WHERE CurrencyRateID IS NOT NULL; 


Here's an example for a query (call it Query 1) that 
filters all rows with a certain known currency rate ID: 


SELECT * 
FROM Sales.SalesOrderHeader 
WHERE CurrencyRateID - 4; 


Because the optimizer is smart enough to realize 
that this filter is supposed to exclude NULLs, it will 
consider using the index idx currate notnull. Then, 
whether the index is used depends on the filter’s selec- 
tivity. As the execution plan for this query shows (see 
Figure 1, page 22), the index was used for the query. 
When you use a range filter predicate to create a 
filtered index, the optimizer considers using the index 
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Execution plan for 


even when the query requests a subinterval of the 
Query | 


range. As an example, the following index is defined 
on the Sales.SalesOrderHeader table, with the Freight 
column as the key, based on the predicate Freight >= 
$5000.00: 


CREATE NONCLUSTERED INDEX idx freight 5000 or 
more 
ON Sales.SalesOrderHeader(Freight) 
WHERE Freight >= $5000.00; 


The following query (call it Query 2) demonstrates a 
request for a subset of rows from the table based on 
a subinterval of the range defined in the filtered index 
idx freight 5000 or more: 


SELECT * 
FROM Sales.SalesOrderHeader 
WHERE Freight BETWEEN $5500.00 AND $6000.00; 
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Figure 2 

Figure 2 shows the plan for this query. As you can see 
in the execution plan, the index idx_freight_5000_or_ 
more was used. 

Filtered indexes can also have an INCLUDE 
clause, which is useful when you want the index to 
cover certain queries and thus avoid lookups. For 
example, the following code creates an index on 
the OrderDate column; includes the columns Sales- 
OrderID, CustomerID, and TotalDue; and filters only 
rows where TerritoryID is equal to 5: 


Execution plan for 
Query 2 


CREATE NONCLUSTERED INDEX idx_territory5_ 
orderdate 
ON Sales.SalesOrderHeader(OrderDate) 
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Figure 3 


Execution plan for 
Query 3 
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INCLUDE(SalesOrderID, 
CustomerID, TotalDue) 
WHERE TerritoryID - 5; 


#4 


Consider the following 
query (call it Query 3): 


M 
{ү 


57 % SELECT SalesOrderID, 


CustomerID, OrderDate, 
TotalDue 
FROM Sales. SalesOrderHeader 
WHERE TerritoryID = 5; 


Figure 3 shows the execution plan for this query. As 
you can see in the plan, the optimizer uses the index 
idx_territory5_orderdate to provide complete coverage. 
Because the query requests all rows in which the ter- 
ritory ID is 5, the optimizer chooses a full scan of the 
leaf of the index. 

Add to the query a filter based on the index key 
column—OrderDate in our case—as the following 
query shows (call it Query 4): 


SELECT SalesOrderID, CustomerID, OrderDate, 
TotalDue 

FROM Sales. SalesOrderHeader 

WHERE TerritoryID = 5 
AND OrderDate >= ‘20049101’; 


Figure 4 shows the execu- 

tion plan for this query. The 
ol plan shows that the opti- 
mizer efficiently uses a seek 
operation (plus a partial 
scan) against the index idx_ 
territory5 orderdate, 
because the index not only 
covers the query but also 
sorts the data by OrderDate. 
Only the relevant range of rows is scanned within the 
index since OrderDate is the first (and only) key in 
the index. 

Similar to filtered indexes, SQL Server 2008 also lets 
you create filtered statistics. For example, the following 
code creates filtered statistics on the OrderDate column 
in the Sales.SalesOrderHeader table, only for the subset 
of rows from the table where the territory ID is 4: 


50% 


50% 


CREATE STATISTICS stats_territory4_orderdate 
ON Sales.SalesOrderHeader (OrderDate) 
WHERE TerritoryID = 4; 


Filtered indexes can also be used in conjunction with 
another new SQL Server 2008 feature—sparse col- 
umns, which don’t consume 
any storage space for NULLs. 
Sparse columns is a topic for 
another article, but for now ГЇЇ 
just say that in SQL Server 2008 
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Execution plan for Query 4 


Msg 2601, Level 14, State 1, Line 1 


Cannot insert duplicate key row in object ‘dbo.T1’ with unique index 'idx coll notnull'. 


The statement has been terminated. 


Figure 5 


Error message from trying to insert two rows with the same known coll value 


you will be able to create a large number of sparse 
columns (as many as 30,000) and support those 
columns with a large number of indexes/statistics 
(as many as 1,000). 


Unique Constraint with 
Multiple NULLs 

You can define a filtered index as unique if you 
need to. Doing so can help the optimizer and can 
be used to enforce uniqueness. For example, DBAs 
sometimes need to enforce a unique constraint that 
allows multiple NULLs (i.e, prevents duplicates 
in known values, but not with NULLs). Although 
SQL Server doesn't support such a constraint, 
filtered indexes offer an easy alternative. The fol- 
lowing code shows an example: 


IF OBJECT ID('dbo.Tl1', 'U') IS NOT NULL DROP 
TABLE dbo.T1; 


CREATE TABLE dbo.T1(coll INT NULL, col2 
МАКСНАК (10) NOT NULL); 


CREATE UNIQUE NONCLUSTERED INDEX idx coll. 
notnul] 
ON dbo.T1(col1) 
WHERE coll IS NOT NULL; 


As you can see, a unique index is created on 
Tl.coll, with a filter that excludes NULLs. This 
means that uniqueness will be enforced on known 
values but not on NULLs because NULLs aren't 
part of the index. To test the index, try to insert two 
rows with the same known coll value: 


LEARNING PATH 
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INSERT INTO аро.Т1(со11, col2) VALUES(1, ʻa’); 
INSERT INTO аро.Т1(со11, col2) VALUES(1, ʻa’); 


The first INSERT will be successful, but the second 
will fail with the error message in Figure 5. Con- 
versely, you will have no problem inserting multiple 
rows with NULL in coll: 


INSERT INTO dbo.Tl(coll, col2) VALUES(NULL, 'а'); 
INSERT INTO dbo.Tl(coll, col2) VALUES(NULL, 'а'); 


When you're done experimenting with filtered 
indexes, run the following code for cleanup: 


USE AdventureWorks; 

DROP INDEX Sales.SalesOrderHeader.idx currate - 
notnull; 

DROP INDEX Sales.SalesOrderHeader.idx - 
freight 5000 or more; 

DROP INDEX Sales.SalesOrderHeader.idx - 
territory5 orderdate; 

DROP STATISTICS Sales.SalesOrderHeader.stats - 
territory4 orderdate; 

DROP TABLE dbo.T1; 


Benefits 
Filtered indexes and statistics let you optimize your 
queries for certain subsets of rows but don't incur 
the same costs as regular indexes. Filtered indexes 
consume less storage space, are faster and cheaper 
to maintain, and can be used as an alternative to a 
unique constraint that allows multiple NULLs. ፳፻ 
InstantDoc ID 39567. 


"Column Statistics Give the Optimizer an Edge," InstantDoc ID 97809 
"Unique Constraint with Multiple NULLs," InstantDoc ID 98678 
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Data Warehouse 


and 


ata warehouses are evolving from the 

traditional information archive that sup- 

ports business intelligence (BI) users 
and reporting tools, becoming instead true analytical 
repositories that support mission-critical functions 
of the enterprise. My definition of mission-critical is 
any system that supports revenue generation or cost 
control. Should such a system fail, users would have 
to manually perform these formerly automated tasks 
to prevent loss of revenue or increased cost. In light 
of the mission-critical nature of the data warehouse, 
it’s necessary to optimize the design for maximum 
performance. You begin this optimization effort by 
identifying what type of workload your warehouse is 
dealing with. 

In the study “Magic Quadrant for Data Warehouse 
Database Management Systems, 2007,” the Gartner 
Group identified six distinct workload types that typify 
the way organizations use their data warehouses. The 
following are descriptions of the workloads: 

1. Continuous (near real-time) data loading for 
dynamic data elements. This workload creates an 
environment that’s very similar to an OLTP data- 
base workload because indexes and other optimiza- 
tion structures have to be continuously updated. 
Continuous updating can have a profound effect on 
summary and aggregate data, which is used to sup- 
port dashboards and prebuilt reports. 

2. Batch data loading, as organizations realize that 
not all data needs to be refreshed continuously. 

3. Warehouses that support reporting systems and 
large numbers of standard, prebuilt reports (thou- 
sands per day), which require indexing and tuning 
for specific SQL queries, new types of storage 
partitioning, and other types of optimization struc- 
tures in the data warehouse. 

4. Tactical business analytics (as opposed to the tra- 
ditional strategic uses that data warehouses were 
once limited to) in which business professionals 
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Determining how your 
data warehouse Is used 


with little or no programming experience use ana- 
lytical applications to drill down and drill through 
multidimensional objects. Business users rely on BI 
architects to build commonly used cubes for their 
work. 

5. Warehouses that support users of true ad hoc que- 
ries (1.е., data miners), who use the data in random, 
unpredictable ways, making it impossible to opti- 
mize and tune their queries. 

6. Analytic and Bl-oriented functions in conventional 
OLTP databases (a re-emerging trend). 


In my experience, there are also five distinct data 
warehouse use cases (1.e., stereotypes of real conditions 
in which data warehouses are used). Each use case 
includes one or more of the Gartner Group's work- 
loads and has specific infrastructure needs and require- 
ments. By identifying which use case most closely 
matches your data warehouse, you can better 
tune for performance. On the downside, 
if your data warehouse is trying to be all 
things to all people, you might consider split- 
ting the data warehouse to optimize for the 
desired functionality. Here are my five data 
warehouse use cases and the workloads they typically 
support. Once you've identified your data warehouse 
among them, you can then look at optimizing it. 


The Traditional 

Data Warehouse 

The type of structure you think of when you hear the 
name ^Bill Inmon" is the traditional data warehouse. 
(For more information about Inmon, see the web- 
exclusive sidebar “Who is Bill Inmon?” www.sglmag 
сот, InstantDoc ID 99649.) It usually contains very 
large data volumes with daily or weekly data refreshes. 
The scope and breadth of data might be very wide 
and deep because this data warehouse typically serves 
a very large audience. This type of data warehouse is 
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DATA WAREHOUSE WORKLOADS AND USE CASES 


what most organizations start with or migrate to after 
their proof-of-concept data mart project. It’s the con- 
ventional data store used for BI and will support vast 
reporting systems. Its content can help the management 
team determine answers to strategic questions such as 
“Are we doing the best job positioning ourselves in the 
marketplace?” However, if your users need to use the 
data warehouse for tactical or data mining purposes, I 
recommend spinning off individual data marts designed 
for those purposes. Workloads 2 and 3 are consistent 
with what you'd expect to find in a traditional data 
warehouse. 


The Data Mart 

The data mart is typically characterized by its size and 
focus. Data marts tend to house less data than data 
warehouses and to be narrow in scope. They hold data 
for a particular department or function of a company. 
Data refreshes and retention are based on the needs of 
that department or function. It’s not unusual to have 
both real-time feeds and daily or weekly refreshes in a 
data mart because key decisions will be made based on 
the most up-to-date data possible. Data marts are small 
and include any and all kinds of data that’s needed to 
service the data mart consumers, so they’re ideal for 
sourcing dashboards and for supporting analytics and 
data mining because the data can be processed quickly. 
You might find workloads 1, 2, 3, 4, and 5 associated 
with a data mart. 


The Real-Time Data Warehouse 
A real-time data warehouse has continuous data 
feeds and can be used for near-real-time reporting. 
The development and popularity of this type of data 
warehouse has been driven by organizations’ increased 
desire to have the most current, integrated information 
possible at their fingertips. Some of the problems you 
might encounter in a real-time data warehouse include 
constant resource contention between real-time feeds 
and (often ad hoc) queries being made against the same 
set of data objects, daily or even hourly increases in 
data storage, purging of unneeded data, and a commu- 
nity of users that might be very broad or very narrow, 
depending on the type of data contained within the 
data warehouse. Workload 1 is often related to a real- 
time data warehouse. 


The Historical Data Warehouse 

In the past couple of decades, the government has 
mandated that many types of businesses have to keep 
large amounts of historical information (financial 
information, especially) for compliance reasons. These 
mandates have caused an increase in the popularity of 
the historical data warehouse. This type of data ware- 
house contains data volumes that are many times larger 
than that of traditional data warehouses, requiring you 


to pay close attention to storage structures and parti- 
tioning. Data refreshes are semi-frequent (1.е., weekly 
or even monthly), and there’s usually less query traffic 
than with traditional data warehouses, data marts, or 
real-time data warehouses. Queries that are run against 
historical data warehouses are usually well-defined, 
which means you can optimize them for performance. 
Workloads 2 and 3 are consistent with what you might 
find in a historical data warehouse. 


The Analytic OLTP Warehouse 
An analytic OLTP warehouse is one that has been cre- 
ated to be all things to all people. In the old days—when 
hardware was more expensive than а programmer’s 
time—one database was used for both rapid transac- 
tional updates and resource-intensive analytical queries. 
Mixing these two workload types slowed applications 
down and made them non responsive. According to the 
Gartner Group, there seems to be a resurgence of this 
type of architecture, fueled by cheap, powerful hardware 
and lack of experience on the part of both DBAs and 
business decision-makers as data warehousing works its 
way down the economic ladder into the small-to-mid- 
sized business (SMB) market. This architecture always 
has been and always will be a worst practice because 
it’s neither scalable nor extendable. Workload 6 is often 
associated with the analytic OLTP warehouse. 


Workload Mix and Match 

You need to be aware of what types of workloads 
your data warehouse is supporting. According to 
the Gartner Group, “through 2010, mixed workload 
performance will remain the single most important 
performance issue in data warehousing.” As a direct 
result of the need for real-time data loading and auto- 
mated transactions (e.g., sourcing dashboards), “the 
transactional DBMSs may be able to erode the perfor- 
mance edge that was formerly attributed to specialized 
data warehouse DBMS solutions.” In other words, for 
certain applications, such as dashboards, your best 
datasource might be one or more transactional data- 
bases, not your data warehouse. 

Data warehouses are growing in size and complexity 
as more businesses develop and use them. The mix of 
workloads in a single data warehouse is creating more 
problems than the size of the data warehouse because 
one type of workload will have different resource needs 
than other types of workloads. Optimizing for these 
various and contradictory requirements might not be 
possible; if you can’t split the workloads into separate 
data warehouses or data marts, you might have to settle 
for “good enough” or less. By becoming aware of what 
types of data warehouses you have and what types of 
workloads they perform, you can determine how to 
optimize them. ЕП 
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SDB301: SQL AGENT 
ANDREW KELLY 


SQL Server Agent is used by almost every DBA but 
most do not truly understand it's potential. SQL 
Server 2005 along with 2008 has brought some 
great new additions to the SQL Server Agent, which 
makes it even more useful and secure than ever. We 
will explore each of the security roles associated 
with SQL Agent including the proper use of Proxy 
accounts and Credentials that make SQL Agent flex- 
ible yet secure. Several other aspects that you are 
probably under utilizing now including Tokens, 
Alerts, Schedules and more will be covered in detail 
along with demos that highlight each area. 


SDB302: CLUSTERING IN SQL SERVER 2008 
RONALD YENKO 


Have you always wanted to set up a high availability 
SQL Server cluster but thought it was too expensive 
or complicated? In this session, we'll take a look at 
where clustering makes sense (and maybe where 
another approach might fit better). We'll go through 
a methodology on how to design and set up a clus- 
ter (from the relatively economical to mega-power- 
ful. The session will include gotcha's and 
workarounds for smoothly clustering SQL Server 
Integration Services. We'll also cover some tips and 
tricks that will keep your clusters running smoothly 
without constant attention from you. 


SDB203: ARE YOU MISCONFIGURED? 
ANDREW KELLY 


Do you know what options you should change or set 
when installing or configuring a SQL Server 
instance? Is your memory set correctly? How about 
your disk subsystem? What about the placement 
and configuration of the database files? Is your sys- 
tem even scalable? If you aren't sure what you are 
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supposed to touch or why, you need to attend this 
session. We will cover the common configuration 
mistakes with SQL Server and the underlying server. 
There is no "one setting suits all" so we will explore 
when it is appropriate to change a setting from the 
default and why and how we choose a particular set- 
ting. Getting the settings correct may become much 
harder as the database grows in size and complexi- 
ty-let's get it right the first time. 


SDB304: GETTING THE MOST OUT OF THE 
SQL SERVER PROFILER 2005/2008 
BRAD MCGEHEE 


One of the most powerful tools available to the SQL 
Server DBA is SQL Server Profiler. While many DBAs 
are familiar with the fundamentals of this tool, they 
often lack a complete understanding of how they 
can use it to identify and troubleshoot many differ- 
ent types of SQL Server problems. In this session, 
attendees will learn specific techniques that they 
can apply to solving common SQL Server problems. 
These include: slow performing queries, blocking, 
deadlocking, sort warnings, excessive auto stats 
activity, excessive stored procedure compilations, 
excessive table/index scans, and SQL Server Profiler 
best practices. 


SDB305: ESSENTIAL DATABASE 
MAINTENANCE 
KIMBERLY L. TRIPP & PAUL S. RANDAL 


In this session, Paul and Kimberly will run you 
through their top-ten database maintenance recom- 
mendations, with a lot of tips and tricks along the 
way. These are distilled from almost 30 years com- 
bined experience working with SQL Server cus- 
tomers and are geared towards making your data- 
bases more performant, more available, and more 
easily managed (to save you time!). Everything in 
this session will be practical and applicable to a 
wide variety of databases. Topics covered and myths 
debunked include: backups, shrinks, fragmentation, 


statistics, and much more! Focus will be on 2005 but 
we'll explain some of the key differences for 2000 
and 2008 as well. 


SDB306: HOW TO INTERPRET QUERY 
EXECUTION PLANS IN SQL SERVER 
2005/2008 

BRAD MCGEHEE 


One of the black arts of being a DBA is how to read 
and interpret Graphical Query Execution Plans. This 
is a key thing to understand as Execution Plans pro- 
vide valuable clues as to why a particular query may 
be running slowly. In this session, you will learn how 
to read an Execution Plan, and learn what to do with 
the information you have gained from reading it. 


SDB307: MAXIMIZING PLAN RE-USE 
IN SQL 2008 
ANDREW KELLY 


Lack of plan reuse is one of the biggest performance 
killers in SQL Server and almost everyone has expe- 
rienced problems in this area before. We will explore 
exactly how to determine if this is an issue for you 
and more importantly, how to address it. See what 
factors determine if a plan gets reused or not and 
why. We will cover how to issue calls to the database 
that guarantee reuse and peak performance. 
Understanding this aspect of SQL Server is some- 
thing every good DBA should be fully aware of and 
attending this session will get you the insight to get 
started immediately. 


SDB208: TIPS AND TRICKS FOR 
UPGRADING TO SQL SERVER 2008 
RONALD YENKO 


Many organizations are still in the process of 
upgrading from SQL Server 2000 to 2005. When 
looking at going to SQL Server 2008, you may have 
a decision to make-upgrade directly from SQL 2000 
to SQL 2008-or-upgrade to SQL 2005 first, then to 


2008? In this session, we'll look at factors such as 
incompatibility issues (e.g. deprecated functionality, 
security models, etc.), choosing the appropriate 
upgrade mechanism, and rollback planning. These 
can help you chose the best upgrade path for your 
organization. Like so many other things in life, tim- 
ing is everything. A little strategic evaluation and 
planning on your part could result in a 2-for-1 
upgrade process-saving you both time and money. 
It could also result in you saving your company from 
a costly situation in attempting to upgrade to a ver- 
Sion your environment for which it is not ready. Most 
of the evaluations and processes would also apply 
to a SQL 2005 to SQL 2008 migration. 


SDB309: MASTERING THE SQL SERVER 
2008 RESOURCE GOVERNOR 
BRAD MCGEHEE 


For years, | have seen many, many questions in 
forums asking how individual queries or users 
could be restricted from taking more than their 
fair share of server resources. Until SQL Server 
2008, this was not possible. But now, with the help 
of the new SQL Server 2008 Resource Governor, 
DBAs have the ability to allocate SQL Server 
resources among connections. In this session, we 
Will learn about the benefits and features of the 
Resources Governor, take a look at its architecture, 
and learn how to configure and implement it. We 
will also take a look at best practices. 


SDB210: TIPS AND TRICKS FOR AUDITING 
IN SQL SERVER 
RONALD YENKO 


SQL Server 2005 introduced tools to help you audit 
what was occurring in the structure of your data- 
base (e.g. DDL triggers). SQL Server 2008 is intro- 
ducing even more auditing tools-including auditing 
of data changes and even reads. If you were to audit 
everything possible, your system would likely 
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expend more resources auditing than serving cus- 
tomers. In this session, we'll go over the questions 
you need to ask your business to determine what 
level of auditing is appropriate. Then we'll go 
through how to set up many of the common audits 
to meet your business’ requirements. 


SDB311: SAY NO TO MAINTENANCE PLANS 
ANDREW KELLY 


The built in Maintenance Plans are OK to get up and 
running in a hurry but they are far from ideal. There 
are inherent limitations with the implementation 
and in the event there is a problem, troubleshooting 
can be almost impossible. This session will cover in 
detail a set of stored procedures that you can use to 
create your own custom maintenance jobs. You will 
have the flexibility to modify them to suite your 
every need far easier than the Maintenance Plans 
ever could. And since these utilize pure T-SQL they 
are easy to maintain and far easier to troubleshoot 
than the "black boxed" built in maintenance plans. 
Creating your own custom jobs is fast and as a 
result you will have a much better understanding of 
what is being done by each task putting you in a bet- 
ter position to effectively troubleshoot any prob- 
lems that may arise. 


SDB412: DBCC CHECKDB: 
THE DEFINITIVE GUIDE 
KIMBERLY L. TRIPP & PAUL S. RANDAL 


Detecting corruption is critical to limiting down-time 
and data-loss, and DBCC CHECKDB is the way to do it. 
However, to most people it's a black-box tool that 
eats system resources, takes a long time to run, and 
spits out cryptic output. How does it work internal- 
ly? What does it do? When should it be run? What 
does the output mean? Paul wrote DBCC CHECKDB 
for SQL Server 2005 and can explain the answers to 
all these questions. Come to this session to see 
DBCC CHECKDB demystified by the author of the 
code. Although this session is only an hour, Paul and 
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Kimberly will stay through lunch for an open Q&A-so 
go grab your boxed lunch and come back to keep lis- 
tening and learning! 


SDB313: FOLLOW THE RABBIT: 
INTERACTIVE Q&A ON DATABASE 
MAINTENANCE 

KIMBERLY L. TRIPP & PAUL S. RANDAL 


If there's something you want to know about 
Database Maintenance, we've got the answer for 
you! During the week there have been sessions on 
many aspects of database maintenance, but maybe 
YOUR question hasn't been answered. So, bring 
along your questions as we discuss the ins and outs 
of keeping a database healthy. This session might 
not seem as structured as other sessions, but you'll 
be surprised at how informative and fun it is! This 
follows straight on from the DBCC CHECKDB session, 
plus open lunchtime Q&A-we've had such great suc- 
cess with these Q&A sessions that we want to maxi- 
mize your time in them. 


SDB414: PERFORMANCE TUNING USING 
X/EVENTS 
GERT DRAPERS 


When it comes to performance tuning, you want all 
the insights possible in to what SQL Server is doing. 
X/Events is the tooling that will provide you that 
insight. Not only does it provide session-level infor- 
mation about wait states, resource utilization, locks 
etc., it also allows you to correlate this information 
with the operating system. This enables the experi- 
enced performance troubleshooter and tuner to 
identify the bottlenecks inside SQL Server much 
more efficiently. This session will teach you how to 
use X/Event to become an effective performance 
troubleshooter and tuner. 


SQL SERVER DEVELOPMENT 


SDV401: USER-DEFINED TYPES AND 
AGGREGATES IN SQL SERVER 2008 
DON KIELY 


Running .NET code within SQL Server 2008 has 
opened up a lot of amazing possibilities, providing 
flexible options for working with relational data. But 
defining your own user-defined types and aggre- 
gates is easily the most complex and powerful thing 
you can do with SQLCLR code. That power and com- 
plexity puts off some developers, but you shouldn't 
miss out! SQL Server 2008 expands UDTs in great 
new ways, making it all the more important to know 
how to use them. In this session, we'll explore how 
to create custom types that perform well, write 
attributes correctly, and use the right serialization 
formats. We'll also talk about issues you should con- 
sider when designing your own types, as well as 
when it might be better to use other options. 
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SDV302: GROUPING SETS AND MERGE IN 
SQL SERVER 2008 
ITZIK BEN-GAN 


SQL Server 2008 introduces several important new 
T-SQL features, among them enhanced support for 
grouping sets and the MERGE statement. The 
enhanced support for grouping sets allows you to 
develop more flexible and efficient applications that 
support the growing demands for dynamic analysis 
of aggregated data. The MERGE statement allows 
combining INSERT, UPDATE and DELETE logic as a 
single atomic operation to merge contents of a 
source into a target. This session will cover both 
enhancements in detail and demonstrate some of 
their practical uses. 


SDV303: INDEX INTERNALS AND USAGE 
KIMBERLY L. TRIPP & PAUL S. RANDAL 


Indexes are arguably the most important structures 
in a database yet they are often poorly understood 
and neglected. In this session, you'll be reminded of 
the internals of indexes but our focus will be on how 
they are used by the SQL Server engine and what 
you need to do to make sure they're kept in opti- 
mum health. A myriad of tips, tricks, and optimiza- 
tions will be discussed and demo'd so that you 
improve performance immediately. 


SDV304: SECURITY IN THE CLR WORLD 
INSIDE SQL SERVER 
DON KIELY 


One of the major benefits of writing МЕТ code to 
run in the Common Language Runtime in any envi- 
ronment is its code access security (CAS). CAS pro- 
vides a code-based-rather than user-based- 
authorization scheme to prevent various kinds of 
luring and other code attacks. But how does that 
security scheme co-exist with SQL Server's own, 
greatly enhanced security features? By default 
your .NET code is reasonably secure, but it is all too 
easy for the two security schemes to butt heads 
and cause you grief. During this session, we'll 
briefly look at the concept behind CAS, then 
explore how to make it work for you instead of 
against you as you take advantage of these 
advanced programming features in SQL Server. 


SDV305: T-SQL TIPS AND TRICKS 
ITZIK BEN-GAN 


This session will cover T-SQL tips and tricks. It will 
demonstrate how to solve common T-SQL problems 
elegantly and efficiently. The session will cover 
techniques in Microsoft SQL Server 2005, and will 
also introduce tips and tricks that involve new fea- 
tures and enhancements in Microsoft SQL Server 
2008. The purpose of the session is both to demon- 
strate advanced techniques, but also to have some 
fun with T-SQL. 
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SDV306: HANDLING GRAPHS, TREES 
AND HIERARCHIES IN SQL SERVER 2005 
AND 2008 

ITZIK BEN-GAN 


Graphs are specialized data structures that repre- 
sent relationships between pairs of items. Examples 
for graphs include an Employee Organizational 
Chart and a Bill of Materials. This session explains 
how to query graph data stored in SQL Server to 
answer common requests, including: 
= Subgraph, e.g., return all subordinates 
of a given manager 
m Path, e.g., return all management chain 
of a given employee 
m Presentation, е.0., sort employees such 
that a subordinate would be returned 
after a manager 


This session will explain how to handle such 
requests with recursive queries in SQL Server 2005 
and also with the new HIERARCHYID datatype intro- 
duced in SQL Server 2008. 


SDV407: UNDERSTANDING SQL SERVER 
MEMORY MANAGEMENT 
GERT DRAPERS 


There are three key things to understand when it 
comes to SQL Server scalability and performance: 
execution, 1/0 and memory. This session will take you 
through the internals of SQL Server memory man- 
agement in order to gain the necessary knowledge 
to better understand the underlying SQL Server 
behaviors. Once you understand how SQL Server 
allocates and uses its memory, you will increase your 
effectiveness when dealing with performance tuning, 
scaling, and configuration problems. 


SDV308: WHAT'S NEW, HARDER, AND 
EASIER IN SQL SERVER 2008 SECURITY 
DON KIELY 


Each new version of SQL Server has gotten more 
secure and more complex. More secure is better, but 
more complex means that there are likely to be 
whole new ways of attacking it that no one has 
dreamed up yet. During this session, we'll see what's 
new in security and how it makes development eas- 
ier and harder. Overall your applications are likely to 
be more secure, but you'll need to know more than 
ever about security to make it so. 


SDV409: UNDERSTANDING THE 
SQL SERVER EXECUTION MODEL 
GERT DRAPERS 


There are three key things to understand when it 
comes to SQL Server scalability and performance: 
execution, 1/0 and memory. This session will take you 
through the internals of the SQL Server execution 
and scheduling system. This will provide you with the 
necessary data to better understand the underlying 
SQL Server behaviors and enables you to better trou- 
bleshoot your performance and scaling challenges. 


BUSINESS INTELLIGENCE 


SBI201: UNCONVENTIONAL ETL WITH 
INTEGRATION SERVICES 
STACIA MISNER 


The term ETL is typically associated with traditional 
data warehousing in which common data sources 
are flat files or relational databases, but so much 
more is possible with SQL Server 2005 Integration 
Services. In this session, we'll explore some data 
integration scenarios that could be added to your 
ETL repertoire. This session shows you how to go 
beyond traditional relational or flat file sources by 
using SSIS to extract from XML data and free-form 
text and integrate the results into your analytical 
applications. You'll also discover that loading data 
into relational destinations is no longer your only 
option for preparing data for reporting applications. 


SBI202: OPPORTUNITIES FOR DATA 
PROFILING WITH INTEGRATION 
SERVICES 2008 

STACIA MISNER 


Performing data profiling is a labor-intensive and 
time-consuming task that is a process ideal for 
automation. The new Data Profiling Task in 
Integration Services 2008 allows you to put high- 
quality data into new sources in data integration 
projects, ensure quality is maintained by imple- 
menting data quality projects, and explore data in 
data mining projects. Come to this session to learn 
the features of the Data Profiling Task and how to 
use the Data Profile Viewer application. In addition, 
you will learn how to query the output of the Data 
Profiling Task in the control flow of a package. 


SBI203: WHAT'S SO GREAT ABOUT 
REPORTING SERVICES 2008? 
STACIA MISNER 


Reporting Services gets a major overhaul in SQL 
Server 2008. In this session, we will focus on how to 
use the new report development features that solve 
challenging report layout problems. We will also 
show how to use the new charting and gauge capa- 
bilities to properly display quantitative data. 


SBI304: FOLLOW THE RABBIT: 
INTERACTIVE Q&A ON ANALYSIS 
SERVICES PERFORMANCE 
STACIA MISNER 


The focus of this discussion is how to adjust data- 
base design and server tuning to help you get bet- 
ter performance from your Analysis Services solu- 
tion. There will be only 5-10 slides covering best 
practices to get the conversation started, but most 
of the session time is open for your questions. Come 
participate in this session for an informative and 
interactive experience that will give you practical 
advice to put into practice in your own environment. 


SSWUG 


SSWUGO1: STARTING A NEW DBA POSITION 
CHRIS SHAW 


Starting a new job can be one of the most stressful 
times of your life. This is magnified by the fact that 
the work you complete in the first 30 days may be 
used as an indicator of how well you are going to 
succeed at the company. Yet everyone has to start a 
new position at least once. So how do you make sure 
that you hit the ground running and you make that 
good impression? 


SSWUGO2: INTRODUCING THE SQL 
SERVER PERFORMANCE DATA COLLECTOR 
TED MALONE 


SQL Server Administrators have long asked for tools 
to make performance baselining and tuning easier. 
n SQL Server 2005, Microsoft introduced a series of 
system views and functions that took a huge step 
towards that, and in Service Pack 2, they introduced 
a series of add-on reports to help make the job eas- 
ier. In SQL Server 2008, Microsoft has introduced an 
entire framework to allow administrators to collect 
relevant and actionable performance information. 
his session will introduce the Performance Data 
Collector, and show how it can be used to assist 
DBAs and developers alike. 


SSWUGO3: ANALYSIS SERVICES DEEP DIVE 
TED MALONE 


This session presents SQL Server Analysis services 
best practices and advanced techniques from both 
the developer and administrator perspectives. This 
session focuses on the real-world application of best 
practices for infrastructure and security design, 
cube design, aggregation design, performance tun- 
ing and solution deployment. During the session, 
attendees will gain insight into a real-world Business 
Intelligence product development and deployment. 


SSWUGO4: SOX LESSONS LEARNED 
CHRIS SHAW 


Let me share with you the lessons that | have 
learned by going through one of the most in-depth 
audits | have ever done. We will cover why these 
audits are being done and what we need to do to 
avoid being on the hit list. | will spend time talking 
about the auditors and what they are looking for. | 
will share with you the ways that | have been able to 
mitigate major discrepancies that they have found 
and why they worked the way they did. | can show 
you the documents that they need and the docu- 
ments gotcha's that you should avoid. Even if you 
are not a publicly held company, | think it's only a 
matter of time before non-profits start to have to 
answer to the same audit standards. 


SSWUGO5: INTEGRATION SERVICES 
DEVELOPMENT BEST PRACTICES 
MATTHEW ROCHE 

Are you tired of feeling like you're making the same 
mistakes over and over again? Would you like to have 
a roadmap that outlines the pitfalls you're likely to 
encounter when building ETL solutions with SSIS? 
Then this session is for you! You'll learn how to get 
the most from the SSIS tools and platform through a 
set of SSIS development best practices from a battle- 
scarred database and BI consultant who has sur- 
vived the rough projects and lived to tell the tale. 


SSWUGO6: SQL SERVER INTEGRATION 
SERVICES PERFORMANCE TUNING AND 
OPTIMIZATION 

MATTHEW ROCHE 


SSIS packages have many capabilities, from control 
flow to event handlers to scripting. But the SSIS data 
flow is where the decisions you make will have the 
greatest impact on the performance of your pack- 
ages. In this session, you'll learn what's going on 
under the hood in the SSIS data flow pipeline, and 
how to take advantage of that knowledge to make 
your packages perform better. You'll also learn gen- 
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eral tips and tricks to improve SSIS package per- 
formance and how to get the most out of your pack- 
ages. 


SSWUGO7: PLAN GUIDES 
JEREMY LOWELL 


In SQL Server 2008, there exists a way to change or 
create a new plan for a query without touching the 
query itself.In environments where the database was 
provided by a third-party or in a high availability 
environment, these will help to decrease your impact 
on production systems while improving performance 
at the same time. Over the course of the session, we 
will walk through the creation of a plan guide. 


SSWUGO8: AGILE BUSINESS 
INTELLIGENCE 
TED MALONE 


Many business intelligence projects are grown from 
in-house requirements and take on a life of their 
own. Many times when developers think about В! 
projects, they tend to classify them as non-standard 
development projects and don't apply typical engi- 
neering practices. This session will discuss how Agile 
methodologies are a perfect fit for Business 
Intelligence projects. This session uses a case-study 
approach, modeling a successful real-world ВІ proj- 
ect that was built using Agile methodologies. 


SSWUGO9: POLICY MANAGEMENT 
JEREMY LOWELL 


Developers and DBAs alike can now enforce naming 
conventions at the database with policy-based man- 
agement. There are many other types of policies that 
can be created and evaluated in SQL Server 
2008. This new management tool will allow the devel- 
oper and DBA to spend less time checking on things 
and more time creating and managing them at an 
environment level versus a database by database 
level. This session will show examples of how to cre- 
ate and evaluate policies. 
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NOT JUST SPREADSHEETS: MICROSOFT 
OFFICE AND EXCEL AS A BUSINESS 
INTELLIGENCE DEVELOPMENT PLATFORM 
MICROSOFT 


INTRODUCTION TO SHAREPOINT 
DEVELOPMENT WITH MICROSOFT VISUAL 
STUDIO 2008 

MICROSOFT 


DEVELOPING YOUR FIRST OFFICE BUSINESS 
APPLICATION: FROM THE CLIENT TO 
SHAREPOINT AND BEYOND 

MICROSOFT 


BUSINESS INTELLIGENCE AND 
SHAREPOINT 


HBI101: 10 THINGS YOU NEED TO KNOW 
ABOUT PERFORMANCEPOINT SERVER 
MAURO CARDARELLI 

PerformancePoint Server is the perfect tool for 
extending your SharePoint investment into hard- 
core Business Intelligence. Since PerformancePoint 
is so big and powerful, how do you get started with 
it in your organization? This session will help you 
jump with both feet in by walking you through the 
top 10 things you need to know about PPS and how 
each can add business value to your company. This 
session will include several demonstrations and 
offers the SharePoint gurus a great first step into 
the world of ВІ. 


HBI301: CONNECTING TO YOUR ORACLE 
DATA WITH SHAREPOINT 

MAURO CARDARELLI 

Has your organization invested in MOSS for a cor- 
porate intranet but still has business critical data 
in Oracle? This session will show you how to bridge 
the two by bringing in your Oracle data into 
SharePoint. Whether it is simple reporting or more 
complicated bi-directional updates, this session 
will get the SharePoint and Oracle folks in your 
company working together to build the next wave 
of powerful business user tools. Numerous demon- 
strations and code snippets will be shared. 


HBI302: BUILDING A SALES PIPELINE 
APPLICATION WITH REPORT CENTER 
MAURO CARDARELLI 

Have you started using Report Center? Did you 
know that you can use it for things like managing 
a sales pipeline or broker services? This session 
will walk you through building both. It will also 
include steps for creating basic reports and KPls 
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LIGHT UP YOUR SHAREPOINT WEB SITE 
WITH MICROSOFT SILVERLIGHT AND AJAX 
MICROSOFT 


SHAREPOINT AND ECM: EMPOWERING YOUR 
USERS WHILE MAINTAINING INFORMATION 
GOVERNANCE AND COMPLIANCE 
MICROSOFT 


MASHING-UP THE WEB: SHAREPOINT AND 
SHAREPOINT DESIGNER 
MICROSOFT 


using Report Center data. Whether you are new to 
Report Center or a seasoned pro, this session will 
allow you to walk away with action items for tak- 
ing your implementation to the next level. 


DEPLOYMENT, ADMINISTRATION, 
OPERATION, AND OPTIMIZATION 
OF SHAREPOINT 


HBI303: INTRODUCTION TO MOSS 
ADMINISTRATION 

MICHAEL BLUMENTHAL 

Learn the ABCs of MOSS administration: 
Application Pools, Backups, Central Administration, 
and an alphabet soup of other concepts including 
the five S's of SharePoint: Sites, Site Collections, 
Service Accounts, Server Monitoring, and STSADM, 
as well as Web Applications, Features and 
Solutions, Policies, Quotas, and more! Oh my! 


HBI304: SITE PROVISIONING SOLUTIONS 
MICHAEL BLUMENTHAL 

Are you stuck between not allowing any users to 
create sites and not wanting to burden IT with cre- 
ating them? This session is for you! | | discuss a 
middle ground where the user submits a request 
for a site, approvals are requested, and if all the 
approvals are met, the request is created. 


FORMS AND WORKFLOW 
WITH SHAREPOINT 


HFW301: CONVERT YOUR EXISTING WORD 
AND EXCEL FORMS TO INFOPATH 

ASIF REHMANI 

If your corporation is like most, you probably 
have at least a dozen or so electronic forms that 
have been created using Microsoft Office Word or 
Microsoft Office Excel. Wouldn't it be nice to be 
able to publish these forms directly to your 
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SharePoint site and have your users fill them out 
using their browsers? In this session, you will 
first see how you can convert your existing Word 
or Excel forms to InfoPath forms. Then you'll 
learn how to implement information from exter- 
nal data sources and client-side validation logic 
in these forms. Lastly, you'll see how to publish 
the forms onto a SharePoint site for easy access 
by the users. 


HFW302: DESIGN POWERFUL WORKFLOWS 
WITH SHAREPOINT DESIGNER 

ASIF REHMANI 

Workflows are essential to business processes. 
The ability to route a document or an item 
through multiple stages, gathering feedback and 
approval from multiple people, is a routine proce- 
dure in many companies. Now you can take advan- 
tage of SharePoint's built-in support for workflows 
to design powerful multi-step workflows. In this 
session, you will first see how to make use of the 
built-in workflows directly through SharePoint's 
browser interface. Then, you will see how to devel- 
op multi-step powerful workflows-without code- 
using Microsoft Office SharePoint Designer 2007. 


HFW303: SIGN YOUR INFOPATH 
ELECTRONIC FORMS USING DIGITAL 
SIGNATURES AND PUBLISH 

TO FORMS SERVER 

ASIF REHMANI 

So you think that you can go completely digital 
and eliminate paper altogether from your organi- 
zation? Where would you store these forms? How 
about collecting signatures from people? No wor- 
ries! In this session, you will see how you can dig- 
itally sign your Microsoft Office Infopath 2007 
electronic forms and then publish them to the 
Forms Server in SharePoint to be served up to 
your users through any browser. 


HFW304: ADDING CODELESS WORKFLOWS 


TO INFOPATH FORM SOLUTIONS 
DAVID GERHARDT 


Deploying a Microsoft Office InfoPath 2007 form 
template to SharePoint provides users with a reli- 
able mechanism for collecting data. When work- 
flow is added to the mix, users get a robust solu- 
tion that automates common tasks. This session 
identifies how you can integrate codeless work- 
flows into SharePoint form solutions. 


HFW305: DEVELOPING INFOPATH BROWSER 
FORMS FOR SHAREPOINT 

DAVID GERHARDT 

The Microsoft Office InfoPath 2007 “design once" 
concept allows you to use a single form template 
in rich client and browser scenarios. However, 
there is a reduced set of InfoPath functionality 
available for browser forms. This session explains 
some of the limitations in working with InfoPath 
Forms Services and discusses ways to optimize 
browser forms using out-of-the-box and managed- 
code techniques. 


HFW306: DEVELOPING INFOPATH CLIENT- 
ONLY FORMS FOR SHAREPOINT 

DAVID GERHARDT 

Microsoft Office InfoPath 2007 has a powerful 
designer that offers a wide selection of controls 
and features for SharePoint form solutions. It also 
includes an event framework that provides devel- 
opers options to extend form functionality with 
managed code. This session explores many of the 
InfoPath designer capabilities and examines ways 
to address challenges that arise when deploying a 
form template to SharePoint. 


MOSS SEARCH 


HSE301: CUSTOMIZING SEARCH CENTERS 
TO SUPPORT SEARCH SERVER 2008 
DANIEL WEBSTER 
Ensuring that the search result set has high rele- 
vance is always the focus of search design. This ses- 
sion covers the design topics inherent in developing 
a robust, customized search center topology. This 
session will focus on creating a customized MSS 
end-user experience that enhances the presenta- 
tion and findability of content items in a result set 
without writing code. Like Microsoft Office 
SharePoint Server 2007, MSS effectively divides 
management tasks among farm administrators, 
search administrators, and site collection adminis- 
trators. The abilities of site collection administrators 
to customize search for their users may change 
your search topology design. Considerations in your 
design will include: 

= Activate MSS for the site collection 

m Create Search Centers for site collections 


m Search scopes for the site collection 
m Keywords and best bets for the site collection 
= Indexing of sites, lists, and libraries 
We will review design considerations for: 
m Centralized Search Center 
= Decentralized Search Center 
m Hybrid design 


HSE102: HOW MICROSOFT SEARCH SERVER 
2008 EXPANDED SEARCH IN SHAREPOINT 
SERVER 2007 

DANIEL WEBSTER 

Microsoft introduced a new search product that 
you can use to upgrade and enhance the search 
capabilities of Windows SharePoint Service 3.0. 
Microsoft released a licensed version of this prod- 
uct, Microsoft Search Server 2008 (MSS), in early 
2008, along with a free version, Microsoft Search 
Server 2008 Express (MSSX). You can integrate 
this exciting new product into SharePoint Server 
2007 for free. It extends the search capabilities 
and will cause you to re-think your search topolo- 
ду design. 


HSE303: USING FEDERATED LOCATION 
DEFINITIONS AND LEVERAGING LIVE.COM 
WITH SEARCH 
DANIEL WEBSTER 
Microsoft Search Server 2008 introduced federat- 
ed queries that provide the ability to forward 
user's search terms to external search engines 
concurrently with queries to its own index. The 
topics in this dicussion include: 

m OpenSearch 1.1 

m Federated queries 

= FLD files 

= Installation of available FLDs 

= Create custom FLD files 
Microsoft has implemented a solid integration 
between Live.com and MSS. This session will demon- 
strate how to extend the federated query technolo- 
gy with Live.com to create a customized, highly rel- 
evant result set for your end users. The topics we'll 
discuss include: 

= Create Live.com accounts 

= Live.com basic and advanced features 

= Build federated queries that integrate with 

Live.com 


HSE304: MOSS SEARCH: IMPROVING 
RELEVANCE AND THE SEARCH EXPERIENCE 
USING THE API 

ERIK MAU 

The out-of-the-box search experience can be 
improved by tapping into the API available for 
search. This session takes an in-depth look at the 
search API and introduces patterns that can be 


applied to strengthen both the search user expe- 
rience and search relevance. 


HSE305: MOSS SEARCH: LEVERAGING YOUR 
INVESTMENT IN THE PLATFORM 

ERIK MAU 

SharePoint 2007 provides a powerful search 
engine that is often underutilized. This session 
provides a deep dive into the out-of-the-box capa- 
bilities the platform provides. It will cover both 
search administration and user experience prac- 
tices that will help you unlock the full potential of 
the platform. 


HSE306: MOSS SEARCH: UNIFYING 
BUSINESS DATA AND DOCUMENTS 

ERIK MAU 

SharePoint's Business Data Catalog feature pro- 
vides the ability to integrate your business data 
into the SharePoint platform. By leveraging the 
SharePoint search framework, we can provide 
another entry point into your user's business 
applications. This session focuses on how to incor- 
porate the out-of-the-box capabilities with cus- 
tomizations that use the search API. 


SHAREPOINT ARCHITECTURE AND 
CAPACITY PLANNING 


HAR201: INFORMATION ARCHITECTURE 

FOR A MOSS INTRANET 

MICHAEL BLUMENTHAL 

So you've built your MOSS server farm. You now 
have all the infrastructure in place to host your 
first user-facing site, and it's the corporate 
intranet. What now? In this session, | | provide 
guidance through the critical decision of picking 
your root site template, Web application naming 
issues, site collection planning, and how to organ- 
ize the content on your site. Learn how to inte- 
grate needs for publishing, collaboration, and 
applications into one consistent user experience. 


HAR302: ARCHITECTING A HIGHLY 
REDUNDANT SHAREPOINT 2007 FARM 
MICHAEL NOEL 

This session covers best practice design and plan- 
ning for setting up a highly redundant Office 
SharePoint Server 2007 environment. Technologies 
such as Network Load Balancing of SharePoint Web 
front ends and Application servers are covered in 
depth, and tips and tricks for the deployment of 
redundancy in a SharePoint environment are pre- 
sented. In addition, specifics on how to leverage 
SQL Server 2005 Database Mirroring technologies 
for remote data replication of SharePoint content 
databases is covered and demonstrated in detail. 
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HAR303: BUILDING THE PERFECT 
SHAREPOINT FARM: A WALKTHROUGH OF 
BEST PRACTICES FROM THE FIELD 
MICHAEL NOEL 

SharePoint 2007 has proven to be a technology 
that is remarkably easy to get running out of the 
box. On the flipside, however, some of the 
advanced configuration options with SharePoint 
are notoriously difficult to set up and configure, 
and a great deal of confusion exists regarding 
SharePoint best practice design, deployment, dis- 
aster recovery, and maintenance. This session cov- 
ers best practices encompassing the most com- 
monly asked questions regarding SharePoint 
infrastructure and design, and includes a broad 
range of critical but often overlooked items to 
consider when architecting a SharePoint environ- 
ment. In short, all of the specifics required to build 
the "perfect" SharePoint farm are presented 
through discussion of real-world SharePoint 
designs of all sizes. 


HAR304: VIRTUALIZING SHAREPOINT 
COMPONENTS 

MICHAEL NOEL 

Server virtualization technologies have taken front 
stage recently and many organizations have begun 
to seriously contemplate replacing physical 
servers, including SharePoint servers, with virtual- 
ization technologies. This session focuses on real- 
world architecture and best-practice recommenda- 
tions for incorporating SharePoint architecture 
into virtualized environments running with either 
Microsoft's Virtual Server 2005, Microsoft's 
Windows 2008 Hyper-V Virtualization, EMC's 
VMware Server, and Citrix XenApp products. The 
session focuses on outlining which specific com- 
ponents of SharePoint operate well in a virtualized 
environment versus which ones are not necessari- 
ly good candidates. In addition, this session gives 
an in-depth look at real-world designs for 
SharePoint using both major virtualization prod- 
ucts and outlining the strengths and weaknesses 
of each product in relation to SharePoint function- 
ality and supportability. 


SHAREPOINT CUSTOMIZATION 


HCS301: AUTOMATING COMMON 
SHAREPOINT TASKS WITH POWERSHELL 
NEIL IVERSEN 

Why let the Exchange administrators have all the 
fun? PowerShell can provide a scriptable interface 
into SharePoint to get rid of repetitive tasks or 
troubleshoot a problem. You can use PowerShell to 
simplify common tasks like deployments, manage 
site permissions or even move documents in a doc- 
ument library. You can even move a site from a 
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development to production without touching the 
Ul. This session will help you effectively use 
PowerShell in your own environment. 


HCS202: EFFECTIVELY USING FEATURES 
AND SOLUTIONS 

NEIL IVERSEN 

Features provide the foundation for adding your 
customizations to SharePoint. Solutions form the 
package that can contain many features and files 
to deploy to your entire farm. Together, these two 
provide the ability for you to create redeployable 
packages for SharePoint, but you need to know 
how to use them effectively. Learn how to use them 
to manage change between your different environ- 
ments and promote code and changes through a 
change control process. 


HCS403: PACKAGING YOUR ADVANCED 
SHAREPOINT CUSTOMIZATIONS 

NEIL IVERSEN 

SharePoint provides a set of features for com- 
mon customizations, but what about ones that 
are left out? How can you deploy a custom field, 
policy or event handler? What if you have content 
that isn't even stored in SharePoint? Is it possi- 
ble to deploy a SQL database? Should you? 
Answering these questions requires a detailed 
look at how customizations are deployed and 
their lifecycle and dependencies. This session 
will enable you to create packages to solve your 
deployment problems. 


HCS304: INTEGRATING ACCESS 

AND SHAREPOINT 

TY ANDERSON 

This session will drill into the development fea- 
tures of Microsoft Access as they relate to 
SharePoint. Several demos will be provided show- 
ing how to utilize Microsoft Access as the front-end 
to SharePoint-based solutions. Coverage of man- 
aged Microsoft Access applications using VSTO will 
be included as well. At the end of this session, the 
attendee will understand how to integrate 
Microsoft Access with SharePoint to deliver solu- 
tions within their organization. 


HCS305: OUTLOOK AND SHAREPOINT 
USING LINQ TO SQL 

TY ANDERSON 

Learn how to integrate SharePoint data with 
Outlook using LINQ to SharePoint. | will explain 
how to utilize VSTO and LINQ to build an Outlook 
add-in that automates a common business 
process. Attendees will gain an understanding of 
how to work with SharePoint sites and lists using 
LINQ as well as how to incorporate the data with- 
in the Outlook data store. 


HCS306: BUILDING CUSTOM WORKFLOWS 
WITH VSTO 

TY ANDERSON 

Learn how to build and publish custom SharePoint 
workflows using Visual Studio Tools for Office 
(VSTO). You'll see how to build both sequential and 
state- machine workflows as well as how to deploy 
them to SharePoint. 


SHAREPOINT FOR ENTERPRISE 
CONTENT MANAGEMENT 


HEC301: CREATING AND DEPLOYING A 
CUSTOM DOCUMENT CONVERTER FOR 
SHAREPOINT 2007 

PAUL STORK 

A lot of the information used in organizations 
today is stored as unstructured content in files 
like Word documents. In this session, we will exam- 
ine one of the external services available in 
SharePoint's Web Content Management Features 
that can make this unstructured content more 
easily accessible. Using the Document Converter 
services in SharePoint we can take existing con- 
tent stored in Word 2007 documents and automat- 
ically convert it to a published Web page. But what 
if your content is stored in older Word documents 
or some other unsupported format? We will also 
look at how document conversion can be extend- 
ed by creating and deploying new Document 
Converters that expand on the four formats that 
ship with SharePoint. 


HEC302: DEPLOYING CONTENT AND 
INFRASTRUCTURE IN A WEB CONTENT 
MANAGEMENT ENVIRONMENT 

PAUL STORK 

Content Deployment in Microsoft Office SharePoint 
Server can be used to automate the deployment of 
content in a Web Content Management environ- 
ment.It can be used to develop an automated two- ог 
even a three-tier environment where content is auto- 
matically deployed through the various tiers even if 
they are on different SharePoint Farms. But that's 
only half of the story. Because Content Deployment 
only moves content stored in the SharePoint data- 
bases. What about infrastructure components like 
Web Parts, content types, list templates, and even 
custom master pages? You can use a combination of 
Features and Solutions to automate deployment of 
infrastructure components. This session will discuss 
how to develop a parallel environment where con- 
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tent is moved from an authoring site to a staging site 
and then to production using Content Deployment. 
Meanwhile, infrastructure components are deployed 
using a combination of Features and Solutions from 
a development environment where they are created, 
a testing environment where they are tested, to a 
production environment. This combination of 
Content Deployment, Features, and Solutions will 
provide you with a truly manageable enterprise 
SharePoint installation. 


HEC303: DEVELOPING PRINTABLE 


SHAREPOINT PAGE LAYOUTS USING CSS 
PAUL STORK 

Have your users ever complained because your 
SharePoint sites aren't easily printable? In this ses- 
sion, we'll examine how to build a custom 
Cascading Style Sheet that will be used by pages in 
your site as a printing layout. By designating a 
printing style sheet in your master page you can 
define a printable view of your pages that strip out 
elements that aren't necessary when printing a 
page, like navigation bars, menu buttons, and 
search boxes. You can also define a layout that is 
sized appropriately for a printed page. Learn how 
to make SharePoint more than just an online col- 
laborative environment by making it easy to take a 
quick hard copy snapshot of a site. 


HEC204: FIVE SHAREPOINT FRIENDS IN AN 
ECM WORLD 

SAHIL MALIK 

In this session, Sahil walks you through five 
SharePoint concepts that you will apply repeatedly 
in any ECM delivery. Using these five concepts, 


Sahil will demonstrate Records Management, 
Records Policies, Offline storage, metadata moni- 
toring, and effective search. 


HEC305: LARGE OBJECT STORAGE 

IN SHAREPOINT 

SAHIL MALIK 

Y'all, this is a big problem! How exactly can you 
store large documents inside of SharePoint? By 
large, | mean over 26B in size. And when you do 
store such documents, how exactly does it work 
inside the SharePoint Ul, how does search work, 
and how does SharePoint work? The beauty of any 
Microsoft platform is its extensibility. | guess it's 
beauty we will have to rely on after all. 


HEC306: WHY CONTENT TYPES 

ARE YOUR FRIEND 

SAHIL MALIK 

Content types, we have all used them, we all like 
them. But they take a special meaning in an ECM 
project delivery. In this session, Sahil points out 
some very practical situations where content types 
present an elegant solution to common ECM tasks. 


SHAREPOINT SECURITY 
AND EXTRANETS 


HSC301: SHAREPOINT PERSONALITY 
DISORDER: FBA, ADFS, LIVEID, NTLM, 
KERBEROS, LDAP... EGAD... 

ROBERT GINSBURG 

Microsoft and other third parties have provided a 
myriad of authentication tools for authenticating 
users in SharePoint. However many of the differ- 


ent authentication schemes have corresponding 
authorization requirements? This session will 
focus on discussing the differences between a 
number of SharePoint authentication techniques 
and how to reconcile SharePoint content authori- 
zation, audit trails, and workflow roles, and the 
impact on archival content storage. 


HSC302: YOU WANT TO TRUST WHO? 
FEDERATED EXTRANET SCENARIOS AND 
SHAREPOINT 

(A CASE FOR AN IDENTITY FIREWALL) 
ROBERT GINSBURG 

Microsoft ADFS (and several other technologies) 
have come long way. However, many of them oper- 
ate against transient identities whose identity 
other SharePoint users cannot always understand. 
In this session, we will discuss the need for per- 
sistent identities, how to accomplish them, and 
how to protect your internal network resources 
with an "Identity firewall". 


HSC303: SHAREPOINT AND ILM-ACCOUNT 
MANAGEMENT ON STEROIDS 

ROBERT GINSBURG 

Out of the box, SharePoint provides a simple col- 
laborative system for creating Active Directory 
accounts. However these accounts are only creat- 
ed, not managed. When the business scenarios 
change, no one knows who still needs access and 
who should be restricted. In this session, we will 
discuss how to connect SharePoint and Microsoft 
Identity Lifecycle manager to provision Extranet 
access Active Directory accounts and link them to 
the life of content in the SharePoint store. 


Tuesday, November 11, 2008 11:45 AM Lunch * Sign up at Conference Registration Desk 
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Attend Microsoft ASP.NET, Visual Studio & „МЕТ, or 
SharePoint Connections and attend the sessions of 
the SQL Server Connections conference for FREE! 


A LAP AROUND THE NEWEST FEATURES IN 
ASP.NET AND MICROSOFT VISUAL STUDIO 
2008 FOR WEB DEVELOPERS 
MICROSOFT 


AJAX PROGRAMMING WITH THE MICROSOFT 
AJAX TECHNOLOGIES 
MICROSOFT 


ADO.NET DATA SERVICES FOR THE WEB 
(A.K.A. PROJECT "ASTORIA") 
MICROSOFT 


ASP.NET SERVER CONTROLS FOR MICROSOFT 
SILVERLIGHT, AJAX CONTROL TOOLKIT, AND 
ASP.NET AJAX 
MICROSOFT 


DEVELOPING WEB 2.0 WITH MICROSOFT 
INTERNET EXPLORER TOOLS 
MICROSOFT 


CAR OR MOTORCYCLE: CHOOSING THE ASP.NET 


MVC FRAMEWORK 
MICROSOFT 


BUILDING GREAT WEB APPLICATIONS WITH 
MICROSOFT SILVERLIGHT 
MICROSOFT 


BUILDING MICROSOFT SILVERLIGHT MEDIA 
APPLICATIONS 
MICROSOFT 


SESSIONS AND SPEAKERS 
ARE SUBJECT TO CHANGE. 


|| MICROSOFT ASP.NET SESSIONS 


AAR201: FROM ZERO TO N-TIER 
IN 75 MINUTES 
PAUL SHERIFF 


AAR202: BUILDING N-TIER ASP.NET 
APPLICATIONS WITH LINQ 
DAN WAHLIN 


ACS201: AJAXIFYING EXISTING APPS WITH 
THE ASP.NET AJAX EXTENSIONS 
PAUL LITWIN 


ACS202: USING JQUERY WITH ASP.NET 
RICK STRAHL 


ACS203: USING WCF FOR JSON AND REST 
SERVICES WITH ASP.NET 
RICK STRAHL 


ACS304: AJAX: CHALLENGES 
AND SOLUTIONS 
CHRISTIAN WENZ 


ለር5405: ООР WITH/DESPITE JAVASCRIPT 
CHRISTIAN WENZ 


ADX201: DATA CONTROL TIPS AND TRICKS 
PAUL LITWIN 


ADX202: VIEWING DATA IN THE NEW 
DATA WORLD 
DAVE SUSSMAN 


ADX103: .NET 3.5 LANGUAGE FEATURES 
DAVE SUSSMAN 


AFR201: BUILDING SMART TAGS AND 
PROPERTY BUILDERS FOR YOUR WINDOWS OR 
WEB CONTROLS 

MIGUEL CASTRO 


AGN401: DEMYSTIFYING URL REWRITING AND 
HTTP MODULES AND HANDLERS 
MIGUEL CASTRO 
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AGN202: ASP.NET SEARCH ENGINE 
OPTIMIZATION TIPS AND TRICKS 
ROBERT HOWARD 


AGN303: ASP.NET INTERNALS 
ROBERT HOWARD 


AGN104: CSS 101-BUILDING SITES 
WITHOUT TABLES 
DAVE SUSSMAN 


AMV201: INTRO TO THE ASP.NET 
MVC FRAMEWORK 
MARKUS EGGER 


AMV202: UNDERSTANDING GOALS AND 
FEATURES OF THE ASP.NET MVC FRAMEWORK 
DINO ESPOSITO 


APF301: FROM ONE SERVER TO TWO: 
MAKING THE LEAP 
RICHARD CAMPBELL 


APF302: ASP.NET PERFORMANCE AND SCALE 
TIPS AND TRICKS 
ROBERT HOWARD 


APF203: DEALING WITH LONG RUNNING 
REQUESTS IN ASP.NET 
RICK STRAHL 


ARP201: PROGRAMMING SQL SERVER 
REPORTING SERVICES 
PAUL LITWIN 


ASC401: AJAX AND WEB 
APPLICATION SECURITY 
CHRISTIAN WENZ 


ASL201: SILVERLIGHT WEB CONTROLS... 
YES YOU HEARD RIGHT 
MIGUEL CASTRO 


ASL302: SILVERLIGHT DEVELOPMENT 
FOR MOBILE DEVICES 
NICKOLAS LANDRY 


ASL203: SILVERLIGHT 2.0 ANNOTATION USING 
THE INKPRESENTER CONTROL 
JULIE LERMAN 


ASL204: DATA ACCESS WITH SILVERLIGHT 2 
JOHN PAPA 


ASL305: LEVERAGE SILVERLIGHT 2 
ANIMATION FEATURES 
DAN WAHLIN 


САЦ 


CONNECTIONS 


„МЕТ • MICROSOFT D 


HOW TO REVIEW YOUR CODE AND TEST FOR 
SECURITY BUGS 
MICROSOFT 


INTRODUCTION TO MICROSOFT 


SYNC FRAMEWORK 
MICROSOFT 


LINQ TO XML, SQL, ENTITIES, 
DATASETS, AND CO: DATA ACCESS 
TECHNOLOGIES EXPLAINED 
MICROSOFT 


MICROSOFT VISUAL STUDIO DEVELOPMENT FOR 


MICROSOFT SQL SERVER 2008 
MICROSOFT 


TIPS, TRICKS, AND TECHNIQUES FOR 
BUILDING A GREAT MICROSOFT 
SILVERLIGHT APPLICATION 
MICROSOFT 


ENTITY FRAMEWORK IN A WORLD OF 
SERVICES AND PROCESSES 
MICROSOFT 


WHAT IS THE CONTEXT OF THIS CONVERSATION? 
ENABLING LONG RUNNING CONVERSATIONS IN 
WORKFLOW SERVICES 

MICROSOFT 


DEEP DIVE: ADO.NET DATA SERVICES 
FRAMEWORK - APPLICATION PATTERNS 
MICROSOFT 


ENTITY FRAMEWORK FUTURES 
MICROSOFT 


VISUAL STUDIO & .NET SESSIONS || 


VSM301: ADVANCED TECHNIQUES FOR 
DEPLOYING SMART CLIENT APPLICATIONS 
CATHI GERO 


VCL301: SELECTING THE RIGHT 
CLIENT TECHNOLOGY 
BRIAN NOYES 


VDA309: ANYWHERE ACCESS TO DATA 
MARKUS EGGER 


VDA303: DEEP DIVE INTO ENTITY 
FRAMEWORK OBJECT SERVICES 
JULIE LERMAN 


VVDA307: HOW SYNCHRONIZATION SERVICES 
FOR ADO.NET WORKS 
CATHI GERO 


VDA308: INSIDE BINDING AND MAPPING IN 
LINQ TO SQL 
DINO ESPOSITO 


VDA304: INTEGRATING ENTERPRISE 
LIBRARY'S DATA ACCESS APPLICATION 
BLOCK WITH YOUR PROJECT 

JOHN PAPA 


VDA306: LEVERAGING THE SQL SERVER 
COMPACT EDITION IN AN OCCASIONALLY 
CONNECTED SYSTEM 

WILLIAM R. VAUGHN 


VDA302: MAKING ADO.NET DATA ACCESS 
BREATHTAKINGLY FAST 
DON KIELY 


VDA301: NEW .NET FEATURES FOR 
THE ORACLE DATABASE 
CHRISTIAN SHAY 


VDA305: PRACTICAL STRATEGIES WITH 
THE ENTITY FRAMEWORK 
JOHN PAPA 


VCA300: BUILD A SECURITY SYSTEM FOR 
WINDOWS FORMS USING CLIENT 
APPLICATION SERVICES 

PAUL SHERIFF 


VFR301: .NET WINDOWS FORMS TIPS 
AND TRICKS 
CATHI GERO 


VPA303: PLINQ: LINQ, BUT FASTER! 
STEPHEN TOUB 


VLV200: .NET ROCKS LIVE! 
CARL FRANKLIN & 
RICHARD CAMPBELL 


VPN300: HOW TO RUN 

A CONSULTING BUSINESS 

RICHARD CAMPBELL, ROB HOWARD, 
JUVAL LOWY, PAUL SHERIFF 


VPC301: CODE GENERATION IN 2008 
KATHLEEN DOLLARD 


VPC302: NEW LANGUAGE FEATURES 
IN .NET 3.5 
KATHLEEN DOLLARD 


VPL301: GETTING YOUR TECHNICAL 
WORK PUBLISHED 
WILLIAM R. VAUGHN 


VPA302: PARALLELIZING WCF SERVICES WITH 
VISUAL STUDIO 2008 AND WINDOWS HPC 
SERVER 2008 

NICKOLAS LANDRY 


VTS300: BUILD TEST CASES THE EASY WAY 
WITH VISUAL STUDIO 2008 
PAUL SHERIFF 


VPA301: CONCURRENCY WITH THE TASK 
PARALLEL LIBRARY 
STEPHEN TOUB 


VUI301: THE SCIENCE OF GREAT UI 
MARK MILLER 


VWC303: BUILDING A WCF ROUTER 
FOR YOUR APPLICATIONS 
MICHELE LEROUX BUSTAMANTE 


VWC302: DURABLE WCF SERVICES 
JUVAL LOWY 


VWC304: LOAD BALANCING AND SCALING 
YOUR WCF SERVICES 
MICHELE LEROUX BUSTAMANTE 


VWC301: PRODUCTIVE WCF 
JUVAL LOWY 


VWC305: PRACTICAL SCENARIOS 
FOR FEDERATED SECURITY 
MICHELE LEROUX BUSTAMANTE 


VCF301: ROGUE AND ROUGH: HERE'S 
THE WORLD OF WEB FOR WCF SERVICES 
DINO ESPOSITO 


VWP301: BUILDING DATA VISUALIZATION 
APPLICATIONS WITH THE WINDOWS 
PRESENTATION FOUNDATION (WPF) 
TIM HUCKABY 


VWP303: DESIGNING POLISHED WPF 
INTERFACES WITH EXPRESSION BLEND 
MARKUS EGGER 


VWF301: DEVELOPING SERVICE-ORIENTED 
WORKFLOWS 
BRIAN NOYES 


VWP302: INTEGRATING WPF AND WCF INTO 
YOUR OFFICE BUSINESS APPLICATIONS 
TIM HUCKABY 


VWP304: LEVERAGE AND EXTEND WPF 
ROUTED COMMANDS AND EVENTS 
BRIAN NOYES 


VWP306: „МЕТ 3.5 SP1 FOR WPF DEVELOPERS 
SCOTT STANFIELD 


VWP305: USER CONTROLS AND CUSTOM 
CONTROLS IN WPF 
DINO ESPOSITO 
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DOTNETNUKE 
OPENFORCE 08 


CONNECTIONS 


OPENFORCE SESSIONS 


Connections introduces you to a co-located 
event in Las Vegas, NV at Mandalay Bay 
Resort & Casino, November 10-13, 2008 


DotNetNuke OpenForce ‘08 is a conference for DotNetNuke developers, 


administrators, and designers. This is a great event for DotNetNuke users 


to connect to some of the great things happening in the .NET Open Source 


community, all in a shared space in Las Vegas, November 10-13, at the 


Mandalay Bay Resort and Casino. DotNetNuke OpenForce ‘08 attendees can 


also attend other concurrently running Connections conferences sessions. 


DGN101: The DNN Approach to Community 
and Commerce 
NAVIN NAGIAH 


DGN102: Panel Discussion: 
DotNetNuke-The Road Ahead 

JOE BRINKMAN, NIK KALYANI, 
NAVIN NAGIAH, SCOTT WILLHITE, 
AND SHAUN WALKER 


SKIN DESIGN 


DSD101: Preparing a Skin Design and Creating 
an HTML-Based Skin Package 
TRACY WITTENKELLER 


DSD201: DotNetNuke and Web Standards 
CUONG DANG 


DSD301: Advanced Skinning with DotNetNuke 5.0 
THOMAS KRAAK 
VASILIS TERZOPOULOS 


MODULE DEVELOPMENT 


SITE ADMINISTRATION 


DMD101: Silverlight Development 
with DotNetNuke 
MICHAEL WASHINGTON 


DMD201: Developing Extensions for 
DotNetNuke 5 
CHARLES NURSE 


DMD202: Data Access in DNN DALs-Boldly 
Going where Some Modules Have Gone Before 
JIM BONNIE 


DMD301: Unit Testing with DotNetNuke 
JIM BONNIE 


DMD302: Leverage DotNetNuke 
with SharePoint 
STEFAN KAMPHUIS 


DMD303: Extending DotNetNuke Modules 
with AJAX 
KEVIN SCHREINER 


DMD304: Using Web 2.0 APIs 
in DotNetNuke Modules 
CATHAL CONNOLLY 


DOTNETNUKE OPENFORCE '08 
KEYNOTE ADDRESS 


SHAUN WALKER 
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DSA101: Selling DotNetNuke-Why Your Clients 
Should Be Using It 
CHRIS HAMMOND 


DSA102: Introduction to Search Engine 
Optimization with DotNetNuke 
THOMAS KRAAK 

DSA103: An Introduction to DotNetNuke 


5.0 Administration 
CHRIS HAMMOND 


DSA201: Building Secure 
DotNetNuke Installation 
CATHAL CONNOLLY 


DGN103: DotNetNuke: Real World, Real 
Problems, Real Solutions 
NIK KALYANI 


DMD305: Extending DotNetNuke with 
FaceBook and OpenSocial 
NIK KALYANI 


Please Visit: 
www.DevConnections.com/OpenForce 
for details and a full Tist of topics, 9 
abstracts, speakers and bios! 


IT Connections 


OCTOBER 6-8, 2008 
SAN FRANCISCO MARRIOTT 
SAN FRANCISCO, CA 


BRING YOUR TEAM 
TO IT CONNECTIONS 
IN SAN FRANCISCO 


Strategic and technical expertise 
to guide your technology decisions 
and implementations 


Imagine the opportunity for your IT 
leadership-executives and management- 
to develop strategic visions for your 
enterprise technology with the guidance of 
industry leaders...and for IT professionals to 
master the details of implementing those 
technologies through in-depth workshops led 
by renowned experts then to come together, 
as a team, joined by IT professionals and 
leadership from other business, academic, 
and governmental organizations... To learn... 
to discuss... to question... to solve... and 

to share. 


IT EVENT * OCT '08 | | 


(a> 
4 | ` 


Early Bird 
Registration Bonus: 
receive a FREE NIGHT 


at the San Francisco 
Marriott if you register 
by August 1" 2008 
(based on a 3-night 
minimum stay) 


This unique 3-day conference will feature 
strategic sessions to help your enterprise align 
important new technologies to support your 
near and long term requirements, including: 


= Virtualization 

= Cloud Computing 

= Automation and Consolidation 
m Green Computing 

m Unified Communications 

m Systems Management 

m Security 


And incredible technical workshops led 
by independent, nationally-recognized gurus 
featuring: 


m Windows Server 2008 and Windows Vista 

m Active Directory, Group Policy, and PowerShell 
m Exchange and Office Communications Server 
m SharePoint 

m System Center 

m Hyper-V and VMware 

m SQL Server 2008 


DEVELOPED BY TECHNET, MICROSOFT, PENTON MEDIA, AND HP 


THIS EVENT IS FOR: 
CIOS / CTOS • TECHNICAL DECISION MAKERS • BUSINESS DECISION MAKERS • IT MANAGERS • AND IT PROFESSIONALS 


Bring Your IT Team! 
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МОМЕМВЕК 9, 2008 


SQL SERVER 


PRE-CONFERENCE WORKSHOP • 9AM-4PM 


DATABASE BEST PRACTICES FOR THE INVOLUNTARY DBA 
KIMBERLY L. TRIPP & PAUL S. RANDAL 


Have you been nominated as "the SQL person" on your team? Are you a 
developer who's suddenly found their test database has become critical for 
your company's business? Have you become a DBA-even only accidentally- 
and do you find yourself managing SQL Server database(s) more and more? 
Are you sure your data is protected? Are you sure your applications can 
scale? The one thing you NEED now, to manage this system correctly-is 
knowledge! We'll cover all of the critical components related to configuring, 
implementing, and maintaining a SQL Server system. Topics will include an 
overview of SQL Server components, protecting and maintaining the data, 
writing effective server-side components (e.g., procedures and transactions), 
and many other items that all require server-side smarts. Come to this work- 
shop to find out the things you need to know to successfully manage SQL 
Server from the beginning-a day spent here will save you many more! 


NOVEMBER 10, 2008 


SQL SERVER 


PRE-CONFERENCE WORKSHOP • 9AM-4PM 

RELATIONAL DATA WAREHOUSING: LEVERAGING KEY FEATURES 
OF SQL SERVER 2005/2008 

KIMBERLY L. TRIPP & PAUL S. RANDAL 


Relational data warehouses are critical to every business, but building one 


using SQL Server has traditionally been difficult due to the lack of enabling fea- 


tures. SQL Server 2005 began addressing this with the addition of partitioned 
tables/indexes and online, piecemeal operations such as index maintenance, 
backups, and restores. SQL Server 2008 adds many significant new features to 


aid in relational data warehousing, such as data compression, change data cap- 


ture, and filtered indexes, plus some excellent additions and refinements to 
partitioning. In this information-packed session, we will examine how you can 
make the most of the key Database Engine features in SQL Server 2005 and 
2008 to implement and manage а mission-critical relational data warehouse. 
Attendees will also receive a copy of our very popular SQL Server 2008 VPC 
with over 24 hours of hands-on labs targeting many of these features. 


PRE-CONFERENCE WORKSHOP • 9AM-4PM 


QUERY TUNING IN MICROSOFT SQL SERVER 2005 AND 2008 
ITZIK BEN-GAN 


This workshop teaches you how to optimize problematic queries by tuning 
indexes and writing efficient code. The workshop covers in detail internal 
structures and index access methods, which are the fundamental building 
blocks that you need to be familiar with in order to master the art of query 
tuning and optimization. Once you get familiarized with those fundamental 
building blocks, you will learn how to put your knowledge into action by 
benchmarking various indexing options, analyzing their performance, and 
choosing the most ideal design. The workshop will also teach you how to tune 
and optimize your solutions by applying query revisions that can yield 
improvements in orders of magnitude in many cases. The workshop will also 
teach you how to get rid of cursors when set-based solutions are most appro- 


priate, and how to identify the uncommon cases where cursors are the last 
resort that will yield better performance than set-based solutions. 


If time permits, the seminar will also discuss compilations, recompilations, 
and reuse of execution plans. 


The seminar will cover new features in SQL Server 2008 where relevant. 


ASP.NET 


PRE-CONFERENCE WORKSHOP * 9AM-4PM 


SILVERLIGHT 2 DEVELOPMENT WORKSHOP 

DAN WAHLIN 

Silverlight 2 provides a powerful framework for building Rich Internet 
Applications (RIAs) that capture the attention of end users. In this workshop, 
Dan Wahlin will discuss Silverlight 2 technologies including language features, 
built-in controls, control templates, styles, networking features, animations, 
data binding, and more. Learn to build engaging applications using the latest 
client-side technology. 


VISUAL STUDIO 


PRE-CONFERENCE WORKSHOP • 9AM-4PM 
INTRODUCING WINDOWS WORKFLOW 
KATHLEEN DOLLARD 


You've heard about Windows Workflow Foundation, but no one has shown you 
how to implement it for your applications. This workshop starts with an 
overview of Windows Workflow and why it's likely to become a key part of 
your development strategy. You'll watch the creation of a complex sequential 
workflow integrating human and computer-based actions. Then you'll see a 
similar problem solved using a state machine workflow. Each workflow 
includes standard Windows Workflow tasks, custom tasks, and rule-based 
decisions. You'll see interactions with things outside the workflow-including 
notifying people and other systems, handling events, and implementing data 
exchange services. The workflow becomes more sophisticated as it integrates 
with the Visual Studio 2008 features that integrate with WCF. Workflows don't 
just handle process-they also manage the grimy details of robust systems- 
transactions and compensation, tracing, status reporting, and exception man- 
agement. You'll see how to implement these details and understand the role 
of the host in providing necessary services. Before closing we'll circle back 
around to designing workflows. Workflow design presents new challenges to 
developers, particularly since you can share this design experience with 
power end users-actually letting them design portions of the workflow. In 
this workshop, you'll learn how to recognize good workflow candidates, make 
decisions on workflow granularity, determine how to share design responsi- 
bilities, and implement workflow details. 


PRE-CONFERENCE WORKSHOP • 9AM-4PM 


ADO.NET ENTITY FRAMEWORK: 0-60 
JULIE LERMAN 


This full day workshop will provide you with a more complete overview on 
ADO.NET Entity Framework, Microsoft's new core data platform, than you could 
possibly get in an handful of introductory sessions. In this workshop, you will 
learn what the Entity Framework is and how it fits into your application and 
enterprise architecture. We will begin with an introduction to the Entity Data 
model, how to build it, and how to implement it in its simplest form. Then you 
Will learn how to take advantage of the true power of the Entity Data Model by 
creating customized mappings. You will learn how to query Entity Data Models 


NOTE: LUNCH IS INCLUDED WITH FULL DAY WORKSHOPS ፥ THE COST OF A WORKSHOP IS IN ADDITION TO THE REGULAR CONFERENCE FEE 
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using LINQ to Entities and Entity SQL with Object Services and stream data with Entity Client. The 
session will also explore some of the more complex features of object services as well as offer 
guidance as to when and where you will want to use the Entity Data Model and which of its core 
querying methods is right for different scenarios. Throughout the workshop we'll look at some 
practical applications of Entity Framework. 


SHAREPOINT 


PRE-CONFERENCE WORKSHOP • 9AM-4PM 
SHAREPOINT SERVER 2007 DOCUMENT MANAGEMENT BEST PRACTICES 
BEN CURRY 


Document management is the process of applying creation, management, storage and other 
rules to how documents are created, persisted and expired within an organization. Document 
collaboration is merely the process of checking out, checking in, and versioning a document 
before it is published. Windows SharePoint Services gives you document collaboration where as 
SharePoint Server 2007 gives you document management. Records management encompasses 
all of that which is document management plus it applies to a broader set of content elements- 
not just documents. Any electronic record, such as a list item or log entry, can be managed as 
well in SharePoint Server 2007 if there is a need to do so. Managing these documents involves 
workflows, templates, expiration policies, and integration with the Microsoft Office suite. This 
workshop will cover the following: 
1. Creating and managing Web applications for document collaboration 

a. Content database planning and management 

b. Information architecture 

c. Site directory 
. Creating and managing document libraries from an administrator's perspective 
. Creating and managing large lists for performance using indexed columns and folders 
. Integration with third-party products and Microsoft Outlook 2007 
. An overview of using Workflows for business processes 
. Leveraging content types for document management 

a. Templates 

b. Expiration 

c. Metadata collection via site columns and document information panels 

d. Workflows 
T. Replacing file shares with SharePoint (or why not to) 
8. Configuring document repositories for search and findability 
9. Managing documents from multiple locations 
10. Creating and managing a records repository 


Tl. Understanding and using the Recycle Bin for item recovery 


ውጮ сл ኤ ሯጩ ~ 


PRE-CONFERENCE WORKSHOP • 9AM-4PM 

PLATFORM EXTENSION MODEL FOR SHAREPOINT PRODUCTS 
AND TECHNOLOGIES 

MICHAEL HERMAN 


The goal of the Platform Extension Model for SharePoint Products and Technologies is to help 
architects and project planners understand how best to map their solution requirements with 
the ITB (In The Box) features of the SharePoint platform to minimize the amount of custom 
coding and maximize the amount of solution development through configuration (solution 
composability). The SharePoint Feature Dependency Network is also introduced. 
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12:30 pm - 
2:15 pm 
2:15 pm 
3:45 pm 
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WORKSHOPS AND SPEAKERS 
ARE SUBJECT TO CHANGE. SEE 
WEB SITE FOR THE MOST CURRENT 


DESCRIPTION AND AVAILABILITY. 


SCHEDULE 
at a glance 


SUNDAY, NOVEMBER 9, 2008 


- 12:00pm Pre-Conference Registration ONLY 
- 4:00pm Рге-сопѓегепсе Workshops 
MONDAY, NOVEMBER 10, 2008 


5:00pm Conference Registration 
- 4:00pm Рге-сопѓегепсе Workshops 
8:30pm — Opening Keynote 
ESDAY, NOVEMBER 11, 2008 • MICROSOFT DAY 
5:00pm Conference Registration 
8:00am ^ Continental Breakfast 
9:00am Keynote 
- 10:30am Conference Sessions 
11:45 ат Conference Sessions 
1:30pm — Lunch 
- 2:30pm Conference Sessions 
3:45pm Conference Sessions 
- 7:00pm — Expo Hall Opens/Opening Reception 

WEDNESDAY, NOVEMBER 12, 2008 
5:00pm Conference Registration 
8:00am ^ Continental Breakfast 
- 9:15am ^ Conference Sessions 
1:15 am ^ Conference Sessions 
12:45 рт Conference Sessions 
2:15pm Lunch 

- 3:30pm Conference Sessions 
- 5:30pm Conference Sessions 

THURSDAY, NOVEMBER 13, 2008 
8:00am Continental Breakfast 
- 9:15am Conference Sessions 
- 10:45am Conference Sessions 
12:30 рт — Conference Sessions 
2:15pm Lunch 

Expo Hall Closes 
- 3:15pm Conference Sessions 
- 430pm — Closing Session 
FRIDAY, NOVEMBER 14, 2008 

- 4:00pm — Post-conference Workshops 


TUESDAY, NOVEMBER 11: 
“WOMEN IN TECHNOLOGY” 


LUNCHEON 
SEE WEB SITE FOR DETAILS. 


NOTE: LUNCH I$ INCLUDED WITH FULL DAY WORKSHOPS + THE COST OF A WORKSHOP IS IN ADDITION ТО THE REGULAR CONFERENCE FEE 
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|| POST-CONFERENCE WORKSHOPS 


NOVEMBER 14, 2008 


SQL SERVER 


ASP.NET 


POST-CONFERENCE WORKSHOP • 9AM-4PM 
DATADUDE TO THE MAX (BRING YOUR OWN LAPTOP) 
GERT DRAPERS 


The latest release of Visual Studio Team System Database Edition, not only 
enables support for SQL Server 2008 and all its new metadata, it includes a 
completely revamped build and deployment engine and new tools for increas- 
ing the quality of your database schemas. This workshop will guide you 
through all the steps from how to get set up, creating your initial enlistments, 
building deployment packages, how to create configuration files for your 
deployment packages, and how to test and validate your deployments. If you 
bring your own notebook you can follow along using the VPC provided. You 
can attend without a laptop but your experience will be significantly better 
with one! This is meant as an advanced workshop and will expect a reason- 
able laptop configuration in order to participate: 

= Virtual PC 2007-already installed 

= At least 1 GB of physical memory w/512 MB dedicated to the VPC environ- 

ment (2 GB is preferred w/1 GB dedicated to VPC) 
= 12 СВ of physical disk space (20+ GB is preferred) 
= DVD drive 


SHAREPOINT 


POST-CONFERENCE WORKSHOP • 9AM-4PM 

THE SHAREPOINT DEVELOPER, DESIGNER, AND POWER USER 
GAME SHOW 

DUSTIN MILLER 


During the SharePoint Developer, Designer, and Power User Game Show, you'll 
learn the right way and the wrong way to develop custom solutions, design 
master pages and themes, and customize your site with tools like SharePoint 
Designer. This post-conference workshop is designed to appeal to developers, 
Web designers, and even "power users" who want to know how best to take 
advantage of SharePoint as a platform for collaboration and development. 
While there will be some focused discussions involving topics like .NET cod- 
ing, master page and page layout design, and data view Web parts, the work- 
shop will include sample code and ideas for every attendee, and is designed 
to allow everyone to take away something useful and powerful for their own 
SharePoint projects, no matter what their role. While there won't be lab 
assignments during this session, written labs will be provided to each 
attendee via an online site exclusive to this post-conference workshop. Plus: 
It's a game show! Plan to have fun and maybe even win some prizes! 


POST-CONFERENCE WORKSHOP • 9AM-4PM 
PATTERNS OF ASP.NET AJAX APPLICATION ARCHITECTURE 
DINO ESPOSITO 


ASP.NET AJAX applications do not have much in common with classic ASP.NET 
applications based on the Web Forms model. Architecturally speaking, AJAX 
applications are based on a two-tier presentation model where the front-end 
runs entirely on the client and is written in JavaScript and the back-end is an 
ad-hoc service layer that exposes public HTTP endpoints. Any data exchange 
between the tiers is based on JSON packets. ASP.NET AJAX provides an excel- 
lent model for designing and building a service-based back-end using either 
ASMX Web services or WCF services. But it is not of much help as far as the 
presentation is concerned. Once the client has collected raw data as the result 
of a server operation, how can you craft some good user interfaces? The first 
problem to face is just the implementation of an HTML factory. This can be a 
piece of software that runs entirely browser side and is coded using 
JavaScript. At the very minimum, it requires helpers for data-binding and tem- 
plates. Alternatively, you can force some of your AJAX services to operate as 
HTML factories and return HTML snippets in front of raw data. Performance 
and design considerations apply to both solutions, as you can easily imagine. 
A full Web-based rich user interface, though, also requires care when it comes 
to time out or throttle calls that may put too much pressure on the server. Not 
to mention that users of an AJAX application are active all the time and can 
start unneeded and unwanted operations if HTML input elements are not prop- 
erly disabled. In the end, architecting a true AJAX solution is not a walk in the 
park. But thankfully a number of recognized patterns exist to provide guidance 
on the toughest decisions. 


VISUAL STUDIO 


POST-CONFERENCE WORKSHOP • 9AM-4PM 
IMPLEMENTING BEST PRACTICE DATA ARCHITECTURES 
WILLIAM R. VAUGHN 


Many companies are at the stage of morphing or merging small applications into 
large systems but fail to implement efficient, scalable, supportable designs. 
Other shops are transitioning from Access/JET, Visual Basic 6.0, or other older 
technologies to the МЕТ Framework but have been unable to get a foothold on 
the best practices and available architectural choices. This often means cus- 
tomers are unhappy with performance, data security, and responsiveness of the 
development team to requested changes. After code review we typically find 
problems that center on data architecture and implementation issues where 
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fairly fundamental concepts have been overlooked or misunderstood. We find 
developers that are unsure about how their DBMS engine works, that they've 
chosen the wrong engine or have pushed it well beyond its limits. We find teams 
and data architectures that fight over shared resources and confusing design 
choices as Microsoft constantly adds new solutions. 


This workshop is designed to help developers gain a solid footing on the 
foundations of data access architecture. We discuss several alternative 
approaches and where each is best suited. We discuss how SQL Server works 
and how to choose the "right" version for their design today and tomorrow. 
We discuss the Visual Studio and SQL Server tools including SQL Server 
Profiler, SQL Server Management Studio, and the Visual Studio code genera- 
tors that can make the job seem easier but might also be the source of per- 
formance or code maintenance issues. This workshop includes Bill's popular 
sessions on "Getting Connected" and "Managing CLR Executables" as well as 
sessions on the new RDL reporting technology. This content is updated to 
reflect Visual Studio SP1 Business Intelligence tools in anticipation of SQL 
Server 2008's RTM. 


This workshop is all about data. It's about data architectures, data validation, 
connecting to data engines, running efficient queries, managing the resultsets, 
building server-side executables, cursors, constraints, indexes and managing 
users, and rights and security. It's about choosing how and where to save, 


retrieve, and protect data. It's about how to build efficient forms-over data appli- 


cations whether they are Windows Forms, WPF, WCF, ASP.NET or whatever new 
paradigm Microsoft thinks up between now and when the conference starts-or 
years in the future. 


WORKSHOPS AND SPEAKERS ARE SUBJECT TO CHANGE. 
SEE WEB SITE FOR THE MOST CURRENT 
DESCRIPTION AND AVAILABILITY. 


POST-CONFERENCE WORKSHOP • 9AM-4PM 
SERVICE-ORIENTATION, WCF, AND YOU 
JUVAL LOWY 


Contrary to common wisdom, service-orientation is not just for high-end appli- 
cations. Every application should be service-oriented, and Windows 
Communication Framework (WCF) is the .NET runtime for developing, deploying, 
and consuming service-oriented applications. But what is service-orientation 
really about? What does it mean for mere developers? Is there substance 
behind the hype? In this comprehensive one-day workshop, Juval will first 
demystify service-orientation for you, and introduce the basic motivation for 
service-oriented applications and their operating principal and concepts. In 
that light, Juval will then describe what WCF is and how it is designed, and 
demonstrate its advantages over traditional .NET programming. You will see 
that WCF is more than just the next generation platform for building connected 
Systems. In many respects, WCF is the next development platform for Windows 
applications, providing system features that are presently crafted by hand on 
top of .NET and Windows. With WCF, every class automatically benefits from 
these system features, from security to transactions to tracing and logging and 
much more. To maximize the use of these off-the-shelf plumbing aspects you 
should push the service boundary down into your system, but taken to its ulti- 
mate conclusion-should every class be a WCF services? And what about per- 
formance? The workshop will next demonstrate the power and productivity of 
WCF, contrasting WCF used granularly on every class with classic .NET in terms 
of performance, throughput and scalability, and will substantiate the provoca- 
tive claim that every class can and should be a service. Don't miss on this 
unique opportunity to understand SOA and WCF from Juval Lowy who has been 
part of the strategic design effort for WCF from the beginning, and who offers a 
profound insight on the methodology, the technology, and its application. 
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GROUP DISCOUNT 
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Register individuals from one 
company at the same time and 
receive a group discount. 


(4th, Sth, 6th...) 


1-3 registrants $1,495 per person 
Additional registrants | $1,295 per person 
after the 3rd ($200 off each) 


Call 800-438-6720 to take advan- 


tage of group discount pricing. 


Notes & Policies: The Conference Producers reserve the right to cancel the conference by refunding the registration fee. Producers can substitute speakers and topics and cancel sessions without notice or 
obligation. Updates will be posted on our Web site at www.DevConnections.com. Tape recording, photography is not allowed at any session. Conference producers will be taking candid pictures of events and 
reserve the right to reproduce. By attending this conférence you agree to tiis policy. You may transfer this registration to a colleague. Please inform us if you have any special needs or dietary restrictions when 
you register. Microsoft ASP.NET and Visual Studio Connections attendees will receive a one-year subscription to MSDN Magazine. Current subscribers will have an additional year added to their subscription. This offer 
is available to U.S. registrants only. $22.50 of the funds will be allocated toward a subscription to MSDN Magazine (a $45.00 value). This is not an additional expense and subtraction from prices listed is not permissi- 
ble. The conference registration includes a 12-month print subscription to SQL Server Magazine. Current subscribers will have an additional 12-months added to their subscription. Subscriptions outside of the United 
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and escape. Enjoy its restaurants, entertainment 

and enormous beach-pool, as well as wireless internet 
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HOTEL ACCOMMODATIONS 

Mandalay Bay Resort and Casino, 3950 Las Vegas Blvd. 

South, Las Vegas, Nevada, is the conference site and host hotel. 
SPACE IS LIMITED so reserve your room early by calling the 
conference hotline at 800-899-5325 or 203-268-3204. 


AIRLINE 
Please call Pericas Travel at 203-562-6668 for airline reservations. 


CAR RENTAL 

Hertz is offering auto rental discounts to attendees. Call the Hertz 
Meeting Desk at 800-654-2240 for reservations and refer to code 
СМЕ 10RO036 (Hertz) under Connections Vegas to receive your 
attendee discount. 


ATTIRE 

The recommended dress for the conference is casual and 
comfortable. Please bring along a sweater or jacket, as 

the ballrooms can get cool with the hotel's air conditioning. 


SPONSORSHIP/EXHIBIT INFORMATION 

For sponsorship information, contact: 

Rod Dunlap 

480-917-3527 phone 

E-mail rod@devconnections.com 

See Web site for more details. www.DevConnections.com 


TAX DEDUCTION 

Your attendance to a DevConnections conference may be tax 
deductible. Visit www.irs.ustreas.gov. Look for topic 513 - Educational 
Expenses. You may be able to deduct the conference fee if you 
undertake to (1) maintain or improve skills required in your present job; 
(2) fulfill an employment condition mandated by your employer to keep 
your salary, status, or job. 


States and Canada will be served in digital. $6.25 of the funds will be allocated toward a subscription to SQL Server Magazine ( $49.95 value). 


Registration & Cancellation Policy: Registrations are not confirmed until payment is received. Cancellations before October 6th, 2008 must be received in writing and will be refunded minus a $100 processing 
fee. After October 6th, 2008, cancellations and no shows are liable for full registration, it can be transferred to the next DevConnections conference within 12 months or to another person. Microsoft, Microsoft .NET, 


Visual Basic МЕТ, C#, Microsoft SQL Server, MSDN, WinFX, and Windows are either trademarks or registered trademarks of Microsoft Corporation. All other trademarks are property of their owners. 
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Sample databases educate with realistic, 
useful illustrations of business scenarios 


ny programmer will tell you: There’s nothing 
А“ a sample bit of code to illustrate how 

something is done. Sample code helps jump- 
start the learning and development process. Learning 
from examples is a great way to absorb new concepts 
and, arguably, is the standard used in almost every IT 
training class. 

In the database world, sample databases, APIs, and 
code serve a similar purpose. Through its CodePlex 
website and other resources, Microsoft has made avail- 
able a wealth of documents, webcasts, virtual labs, code 
libraries, and database samples to help with your SQL 
Server education. SQL Server 2005 introduced a new 
family of sample databases, called AdventureWorks, and 
changed the process for installing sample databases. You 
can still install the classic pubs and Northwind sample 
databases with the more recent versions of SQL Server, 
although you need to download them from the Microsoft 
Download Center. (For details, see the sidebar “Where in 
the World Are Pubs and Northwind?” on page 28.) 


CodePlex 

Possibly Microsoft’s most useful resource is its host web- 
site for open-source projects, CodePlex (www.codeplex 
сот), which stores publically available code samples, 
as well as all of Microsoft’s sample databases. In this 
case, “open source” means the website has plenty of 
third-party contributions and wiki-like entries. It also 
means Microsoft doesn’t control the content, so contrib- 
uted code might not be formally tested and validated. 
Microsoft hosts CodePlex as a storage service for the 
developer community. Data is gathered into “projects” 
based on subject matter. The last I checked, there were 
5,289 projects listed in CodePlex—at least one for virtu- 
ally every Microsoft product. 

CodePlex includes a SQL Server Community page 
(www.codeplex.com/SglServerSamples) that’s chock- 
full of samples—database samples, end-to-end multi- 
technology samples, SQL Server engine samples, SOL 


SQL Server Magazine * www.sqlmag.com 


Server Integration Services samples, and SQL Server 
Reporting Services samples, as well as tools and utilities. 
A link to the latest SQL Server 2008 release candidate is 
at www.codeplex .com/MSFTDBProdSamples/Release/ 
ProjectReleases .aspx?ReleaseId—10901. 

The multi-tabbed SQL Server Databases and 


Samples Overview wiki (www.codeplex.com/SglServer 
Samples/Wiki/View.aspx?title=SQLServerDatabases 


andSamplesOverview) contains new and enhanced code 
for both SQL Server 2008 and SQL Server 2005. The 
home page explains how to remove previously installed 
versions of the AdventureWorks sample databases and 
how to properly re-install them, even in a multi-instance 
environment. The code listings are extensive, covering 
all that’s new in everything from ADO to XML. There’s 
also room for reader comments and feedback. 

CodePlex’s Analysis Services page (www.codeplex 
.com/MSFTASProdSamples) contains explanations and 
code for building a SQL Server Analysis Services (SSAS) 
database for business intelligence (BI) scenarios. The 
newest SOL Server release (SQL Server 2008 Release 
Candidate 0—RCO) is listed in the Current Release 
section of the Home tab. You can click АП Releases to 
expose the link to the SQL Server 2005 SP2a samples 
installation page. Like all pages on the CodePlex website, 
this page provides tabs for discussions, issue trackers, 
source code, stats, and who to contact on the Microsoft 
team. There's so much on CodePlex that I simply cannot 
begin to cover it all. My suggestion is to go to CodePlex 
and start digging. 


Sample Databases 

SQL Server 2005 ships with two sample databases: 
AdventureWorks and AdventureWorksDW. These 
samples illustrate concepts of transactional database 
design and operation and of data warehousing and BI 
design and operation, respectively. The sample databases 
are optional features at install time; if you don't install 
them with SQL Server, you can download and install 
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д ኒ SAMPLE DATABASES 


WHERE IN THE WORLD ARE 
PUBS AND NORTHWIND? 


If you're a long-time user of SQL Server and you've migrated to SQL Server 2005, 
you'd probably like to know where you can find the pubs database. Neither pubs 
nor Northwind is included by default when you install SQL Server 2005. 

Even though my default installation of SQL Server is 2005, when | want a 
small and simple test bed to work with, | find that pubs or Northwind is prefera- 
ble to using AdventureWorks. AdventureWorks is realistic: It has multiple schemas 
and much of the size and complexity of a live system. However, for some testing 
and certain kinds of demonstrations, a large, complicated database isn't always 
optimal, and in those cases | fall back on pubs and Northwind. 

The pubs database is bundled with Northwind on the Microsoft Download 
Center website (www.microsoft.com/downloads/details.aspx?Familyld=06616212- 
0356-46A0-8DA2-EEBC53A68034), which says that you can use these sample 
databases with SQL Server 2005, SQL Server 2000 SP3 or later, SQL Server 
2000 Desktop Engine (MSDE 2000) SP3 or later, and MSDE 2000 Release A. 

After downloading SQL2000SampleDb.msi, double-click the file to run the 
Setup Wizard, which will extract pubs, Northwind, and a readme file into a folder 
named C:\SQL Server 2000 Sample Databases. The readme file contains instruc- 
tions on how to run the installation scripts. The Wizard needs just 1OMB on the 
C drive, but you might want to configure the database files (.mdf, .ndf, and .Idf) 
to reside in your regular \MSSQL\DATA folder, because these files can grow quite 
large. By default, the top-level folder is C:\Program Files\Microsoft SQL Server, 
but you don’t have to adhere to this architecture. 

If you have SQL Server 2000 installed, you should be able to find these 
sample databases in \Program Files\Microsoft SQL ServenMSSQL Data. You can 
simply detach the pubs and/or Northwind sample databases from SQL Server 
2000 and reattach them to SQL Server 2005, or back up and restore them. 

If you still have the SQL Server 2000 installation disks, you can find pubs 
and Northwind in the x86\DATA folder. Copy the files, paste them into a destination 
folder on your hard drive, and attach them using SQL Server Management Studio. 

If you get a copy of pubs or Northwind from a source other than the Microsoft 
Download Center, check with your security administrator before installing either 
database, as they both contain a guest account that could be considered a breach 
of company security. If you obtain pubs or Northwind from the Download Center, 
the database won't have a guest account. 


them from CodePlex, as I explain later. Microsoft also 
makes available a “lite” version of AdventureWorks. 
And finally, a sample database called FoodMart from 
Microsoft's earlier days remains available. 

Adventure Works. AdventureWorks, the transactional 
database, supports OLTP for a fictitious bicycle manu- 
facturer. The AdventureWorks design maps to real- 
world enterprise scenarios, with schemas for Human 
Resources, Person, Production, Purchasing, and Sales. 
There are separate versions for SQL Server 2008 and 
SQL Server 2005; the SQL Server 2008 version doesn’t 
work with SQL Server 2005. 

AdventureWorksDW. AdventureWorksDW illus- 
trates by example how to design a data warehouse. A 
data warehouse is a repository for a company’s historical 
data. (For a high-level data-warehouse overview, see 
“Data Warehousing: The Foundation of BI,” July 2007, 
InstantDoc ID 96191.) The design techniques and table 
architecture of a data warehouse differ significantly 
from an OLTP database. At some point in a DBA’s 
career, his or her company will most likely embark on 
a data warehouse project, and if the DBA is lucky, he 
or she will be part of the process. AdventureWorksDW 


28 September 2008 


provides a leg up on how to design a data warehouse. It 
uses the sample AdventureWorks OLTP database as its 
data source, so you can see how transactional data can 
be transformed into a data warehouse. 

Adventure WorksLT. The “lite” version of Adventure- 
Works, AdventureWorksLT, is a small, denormalized 
version of the AdventureWorks database that focuses on 
a product sales scenario and is not included in the SQL 
Server installation package. According to Microsoft, 
AdventureWorksLT “is helpful for those who are new 
to relational database technology.” However, this “lite” 
database contains too many violations of normalization 
and integrity. Additionally, the only available version of 
AdventureWorksLT is case-sensitive. 

In my opinion as an educator, exposing entry-level 
DBAs to bad database design and touting it as accept- 
able isn’t a good idea. I’ve removed it from my hard drive 
and use pubs and Northwind instead when I need simple 
databases for testing or demonstrations. If you want to 
achieve simplicity in a test database, I recommend lim- 
iting the number of business functions supported by the 
database, but making sure the database design is correct. 
Should you decide to use AdventureWorksLT, you can 


find it at www.codeplex.com/MSFTDBProdSamples/ 


Wiki/View.aspx?title=AWLT Docs. 


FoodMart. Beyond CodePlex, you can find other 
sample database offerings. For example, the Project- 
Distributor website (projectdistributor.net/Releases/ 
Release.aspx?releaseId—331) lets you download a SSAS 
backup of the FoodMart sample database, which 
Microsoft used to distribute with SSAS. FoodMart is 
a Microsoft Access database that encompasses sales, 
employee relations, and inventory management. Like 
pubs, it's used for testing, training examples, and posting 
sample queries and newsgroup questions, but it's used 
for data warehousing and BI instead of for OLTP 
(transactional) databases. A Microsoft Access copy of 
the database (foodmart.mdb) is available on the Seneca 
College of Applied Arts & Technology Microsoft Win- 
dows shareware page at acs.senecac.on.ca/ftp/ms. 


Download and Install 
It's a good idea to install the AdventureWorks data- 
bases, along with SQL Server 2008/2005, on your test 
and development servers. With all the complexity and 
capability in the two latest versions of SQL Server, you'll 
need a controlled environment in which to learn and 
experiment. The AdventureWorks databases include 
a guest user account, so make sure that you arent 
violating company policy by installing the databases. 
You can always remove the guest user if it constitutes a 
violation of your IT and database security; check with 
your security administrator first. 

Some folks believe the AdventureWorks databases 
should be installed on production servers, but I dis- 
agree. А production server is meant to be used for, well, 
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production, and AdventureWorks is a sample database 
meant for testing and learning. Just as you wouldn't use 
your production system for development, neither should 
you use it for testing. 

Downloading and installing the sample databases 
is fairly straightforward. For instance, to download 
AdventureWorks for SQL Server 2005 SP2a, go to www. 
.codeplex.com/MSFTDBProdSamples/Release/Project 
Releases.aspx?Releaseld=4004, decide which version 
you prefer (e.g., case-sensitive, case-insensitive, 32- or 
64-bit), click the link, click I Agree when presented 
with Microsoft's EULA, and download the file to your 
hard disk. It's a Windows Installer (.msi) file, so just 
double-click the file, accept the license agreement, and 
follow the InstallShield Wizard's prompts. The installer 
copies the database files (.mdb and .ldb) to the folder 
you've designated. 

To use the database, you must attach it to an instance 
of SQL Server using SQL Server Management Studio 
(SSMS). Open SSMS and connect to your test server. 
Open the hierarchy so you can see the entry for data- 
bases, right-click Databases, then select Attach. Click 
Add in the Attach Databases dialog box and navigate to 
the folder in which the installer placed the downloaded 
files. You should see only the data file (i.e, Adventure- 
Works data). Click OK, and then click OK again. The 
AdventureWorks database now should be in SSMSS list 
of databases. 


AdventureWorks Cycles 
Storefront 2000 

A database without an end-user application is like a house 
without furniture: It shelters you from the elements, but 
sleeping on the floor gets to be uncomfortable. At the 
Microsoft Download Center (www.microsoft.com/ 
downloads/details.aspx?FamilyID=15D6473F-1441- 


4269-84EA-14B04A0DAE09) you'll find sample code 
that demonstrates how to build an e-commerce website 


for the fictitious AdventureWorks bicycle manufacturing 
company. In addition to illustrating how to access code 
from SQL Server and navigate hierarchical data using a 
tree structure, the code illustrates how to encrypt pass- 
words and other sensitive data. The code was written for 
SQL Server 2000 and Visual Studio .NET 7.0, but it can 
be modified for more recent product releases. 


AdventureWorks Cycles 
Business Scenarios 

OK, you've got the back-end sample databases and 
the front-end sample web applications, and now you're 
probably asking, “How do they work together?” The 
SQL Server 2005 Books Online (BOL), September 2007 
edition, defines the major tables in the AdventureWorks 
OLTP database and describes how to use them, com- 
plete with T-SQL queries to generate information. The 


BOL sales and marketing page is at msdn2.microsoft 
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.com/en-us/library/ms124824.aspx; the product page is 
at msdn2.microsoft.com/en-us/library/ms124670.aspx; 
manufacturing is at msdn2.microsoft.com/en-us/library/ 


ms124499.aspx; and purchasing and vendors is at 


msdn2.microsoft.com/en-us/library/ms124785.aspx. 
If youre familiar with the Northwind database 


and are curious about the differences, go to the Adven- 
tureWorks to Northwind Table Comparison at msdn2 
.microsoft.com/en-us/library/ms124680.aspx. Are you 
a pubs lover? Then take a look at the AdventureWorks 
to pubs Table Comparison at msdn2.microsoft.com/ 
en-us/library/ms124583.aspx for information about how 
those two databases compare. At a high level, the biggest 
difference between AdventureWorks and Northwind 
or pubs is that AdventureWorks takes advantage of 
schemas and incorporates them into its design. 


Schemas 

SQL Server 2005 introduced the concept of a schema 
as separate from a user. Database objects such as tables, 
views, and procedures now belong to a schema rather 
than to a user. In earlier versions, removing a user's 
account from the database compromised access to objects 
owned by that user. Now, since a schema owns the 
objects, access to them remains unchanged regardless 
of what happens to user accounts. AdventureWorks 
contains five schemas: Human Resources, Person, 
Production, Purchasing, and Sales. You can read about 
them and how to access them on the Schemas in Adven- 


tureWorks page (msdn2.microsoft.com/en-us/library/ 


ms124894.aspx). 


If you'd like a visual guide to help you grasp the 
database schema, check out the schema diagrams 
page at www.codeplex.com/MSFTDBProdSamples/ 


Wiki/View.aspx?titl=AWSchemaDiag&referringTitle= 
Home. This page provides diagrams for Adventure- 


Works, AdventureWorksDW, and AdventureWorksLT. 


Online Libraries 

When you need a technical library at your fingertips, 
check out the SQL Server section of the TechNet Library 
at technet.microsoft.com/en-us/library/bb545450.aspx. 
Itoffers BOL, white papers, links to SQL Server Express 
Edition, the SQL Server 2005 Virtual Labs, and the Data 
Platform Developer Center for SQL Server 2008, 2005, 
2000, and 7.0. Tabs on the TechNet website take you to a 
list of learning resources, downloads, and support of all 
kinds, from the Knowledge Base to community forums 
and blogs, webcasts, wikis, and newsgroups. There's so 
much information available for the SQL Server plat- 
forms that it's hard to cover it all. 

If you're offline, you can always use your local copy 
of BOL, which, even though it's an optional component, 
should have been installed as part of client services. 
Open BOL 2008, 2005, or 2000, and type in samples to 
produce a list of locally available samples for everything 
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from sample databases and T-SQL code to utility pro- 
grams and procedural code. 


Resource Kits 

Resource kits are another valuable addition to the DBA's 
library. The typical resource kit is filled with chapter 
after chapter of how-to material and real-world lessons, 
plus utilities, explanations, and code to help you do your 
job better. The SQL Server 2000 Resource Kit, which 
at one time was available as a separate purchase, is out 
of print, but you can still find it at Amazon.com, often 
with the accompanying CD. A free text version of the kit 
is available at www.microsoft.com/resources/documen 
tation/sql/2000/all/reskit/en-us/default.mspx. 

Starting with SQL Server 2005, Microsoft has appar- 
ently changed the resource kit model. Previously presented 
as a bound collection of information, the SQL Server 2005 
resource kit provides distributed information and learning 
opportunities, anchored by CodePlex and the MSDN 
Library (msdn2.microsoft.com/en-us/sqlserver/default 
.aspx). I found a few obscure references to a “SQL Server 
2005 Resource Kit” in online literature, but the resource kit 
itself seems to have vanished. I found a book in print titled 
Microsoft SQL Server 2005 Administrator's Companion 
(Eric Whalen et al., Microsoft Press, 2006) that seems to 
fit the model of a resource kit, and it might be the closest 
volume on the market. But there are both print and online 
books concerning every facet of SQL Server 2005. 


If you're a Microsoft Dynamics NAV 5.0 customer, 
you can get a CD that’s essentially a SQL Server 2005 
Technical Kit for this environment. Dynamics NAV is an 
end-to-end, customizable package that provides a frame- 
work of business units (e.g., financials, manufacturing, 
sales, marketing) on which you can hang your own 
custom solutions. According to a Waldo’s Blog entry 
(dynamicsuser.net/blogs/waldo/archive/2007/12/13/sql- 
server-technical-kit-for-microsoft-dynamics-nav.aspx), 
this CD contains everything that the rest of us would 
want in a resource kit. However, if you aren't a registered 
NAV partner or customer, you're not eligible to receive 
this disk. Pity. 


Where to Start 
When you're working with a new product or building 
a system, there's nothing like learning by example. 
And there's so much information available on the four 
current versions of SQL Server that it's hard to decide 
where to start! I suggest you take stock of what you're 
doing with SQL Server, assess where you are on your 
personal learning curve, and then seek out documents 
and learning materials that will support and advance 
your progress. And, of course, don't forget to down- 
load and install the sample databases—pubs, North- 
wind, and AdventureWorks—so you'll have something 
to practice on. SOL} 
InstantDoc ID 99635 


| 


ix 
us: 


на | 


То 


Reasons tó A 


Windows, IT ዮተ ር) мес 


Windows IT Pro is the leading independent voice in Windows IT, 


reaching 2.5 million engaged users each month. 


68% of readers spend an hour or more reading each issue. 


49% of Windows IT Pro subscribers visit windowsitpro.com on a monthly basis. 


On average, each issue of Windows IT Pro is passed 


94% of our readers are involved with purchasing, with 74% specifying or 


recommending brands and suppliers. 


Readex Reader Survey, September 2007 


5 


avercise with дз 


work 


along to one other colleague. 


30 September 2008 


SQL Server Magazine • www.sqlmag.com 


FEATURE 


T-SQL 10 |: 


VIEWS 


In Lesson 7, 
learn how and when to use views 


until this point, I’ve covered how to 

manipulate data stored in physical 

tables. Now 11 introduce you to que- 
rying data represented by virtual tables. These virtual 
tables are SQL constructs known as views. 

Unlike conventional physical tables, views don’t 
contain data. A view is merely a stored SQL statement 
that, when executed, generates a result set. When you 
query a view, youre really querying the result set of 
a previously defined query. By the end of this lesson, 
you'll have a firm grasp of views and how to create 
them. You'll also understand the advantages and dis- 
advantages of using them. 


The Prerequisites 

To run the examples in this lesson, make sure your 

MyDB database contains the following tables: 

* The Employee table created in Lesson 3 

* The Movie and Genre tables created in Lesson 5 

* The revised MovieReview table created in Lesson 6 
(and not the original MovieReview table created in 
Lesson 4) 

* The MovieReviewSummary table created in Lesson 6 


If you haven't created these tables, you'll find the code 
in the 99765.zip file. To download this file, go to www 
.sqlmag.com, enter 99765 in the InstantDoc ID text 
box, and click the 99765.zip hotlink. 


How to Create Views 
To create a view, you first need to define a SELECT 
statement. The SELECT statement doesn't have be a 
simple query of one table. It can be as complex as you 
need it to be. For example, you can join multiple tables 
or use functions to create computed columns. You can 
even include other views in the SELECT statement. 
However, there are a few items you can't include in 
a SELECT statement used to define a view. As noted in 
the SQL Server 2005 Books Online ( BOL) documenta- 
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tion at msdn2.microsoft.com/en-us/library/ms187956 
„aspx, you can't include 

* COMPUTE or COMPUTE BY clauses 

* ORDER BY clauses unless there's also a TOP clause 
* INTO clauses 

* OPTION clauses 

* References to temporary tables or table variables 


After you have a SELECT statement defined, you 
need to use the CREATE VIEW command. The basic 
syntax is 


CREATE VIEW DatabaseName. SchemaName. ViewName 

C 
ColumnNamel, 
ColumnName2, 
ColumnName3[,...n] 

) 

AS 

--Your SELECT statement 


In the DatabaseName. SchemaName. View- 

Name segment, DatabaseName is the name of the 
database in which the new view will reside and Schema- 
Name is the schema 
to which the table will 
belong. (If you're unfa- 
miliar with schemas, see 
Lesson 6.) The Database- 
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Name and SchemaName 
arguments are optional. 
If you don't include the 
DatabaseName argu- 
ment, the view will be 
created in thecurrent data- 
base. If you don’t include 
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the SchemaName argument, the table will be owned by 
the dbo schema, which is the default schema of the data- 
base owner (DBO). The examples that follow assume 
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LISTING 1: Code to Create the 
vMovieReviewSummary View 


USE MyDB 


-- Delete the vMovieReviewSummary view if it exists. 


T-SQL 101 


the current database 
is MyDB and уоште 
the DBO, so they 
don't include the 


IF (OBJECT ID('vMovieReviewSummary') IS NOT NULL) 


DROP VIEW vMovieReviewSummary 


GO 


-- Create the view. 


CREATE VIEW vMovieReviewSummary 


AS 
SELECT m.Movie, g.Genre, 


LEFTCREPLICATE('* ',AVG(mr.Stars)),10) AS 'Stars', 


COUNT(*) AS 'Reviews' 
FROM MovieReview mr 


INNER JOIN Movie m ON m.MovieID 
INNER JOIN Genre g ON g.GenreID 


GROUP BY m.Movie, g.Genre 
GO 


Chariots of Firewire 
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Figure | 
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optional Database- 
Name and Schema- 
Name arguments. 

You use the View- 
Name argument to 
specify the name of 
the view you want to 
create. View names 
can be as long as 128 
characters and must 
follow the rules of 
identifiers. (If youre unfamiliar with these rules, see 
the SQL Server 2005 BOL documentation at msdn2 
-microsoft.com/en-us/library/ms175874.aspx.) The 
ViewName argument is required. 

Inside the parentheses is where you can list the 
names of the view’s columns. Alternatively, you can 
specify the names within the SELECT statement, as 
the following examples show. Like column names in 
tables, column names in views are limited to 128 char- 
acters, must be unique, and must follow the identifier 
rules. 

Let's create a view with the SELECT statement that 
was used to create the MovieReviewSummary table 
in Lesson 6. То begin, execute Listing 1% code in the 
query window in SQL Server 2005's SQL Server Man- 
agement Studio (SSMS) or SQL Server 2000's Query 
Analyzer. In Listing 1, notice that I broke the code into 


mr.MovieID 
m.GenreID 


Reviews 


Fiction 
Drama 

Drama 
Sci-Fi 
Documentary 
Horror 
Horror 
Comedy 
Documentary 
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two sections called batches. A batch is a series of one 
or more related T-SQL statements that are submitted 
to SQL Server for execution. SQL Server executes 
the commands in each batch sequentially from top to 
bottom. Each batch is separated by the GO keyword. 
In this case, the first batch causes the vMovieReview- 
Summary view to be dropped if it already exists in the 
current database. The second batch contains the com- 
mand that creates the view. 

Note that there are multiple ways to determine 
whether a view or any other SOL object exists in a 
database. I prefer using the OBJECT ID function 


because it's short and to the point. The OBJECT ID 
function returns the object's ID value from the sysob- 
ject table if the object exists. If the object doesn't exist, 
OBJECT ID returns a null value. When the value isn't 
null, SQL Server uses the DROP VIEW command to 
delete the existing view. 

After you run the code in Listing 1, you can 
examine the view by running the code 


SELECT * 
FROM vMovi eReviewSummary 
ORDER BY Stars DESC 


Figure 1 shows the results. 


The Advantages 

Using views offers several advantages. А key benefit 
is that views let you use only the data you're inter- 
ested in. In addition, views can help reduce code 
redundancy and complexity. (Views can also restrict 
access to subsets of data in a table and isolate appli- 
cations from base-table structural changes, but these 
advanced uses for views go beyond the scope of this 
lesson.) 

To demonstrate the benefits that views can pro- 
vide, imagine the following scenario: You work at a 
ficticious company in which employees review movies. 
Your boss has asked you to produce two reports. The 
first report needs to show the least-popular movies 
reviewed so far, those movies' genres and ratings, and 
the names of the employees who reviewed the movies. 
The second report needs to show the top three movie 
reviewers and how many movies they 
reviewed. 

When looking at the Movie- 
Review, Movie, Genre, and Employee 
tables, you become a bit overwhelmed 
because of all the extra data available. 
You're interested only in the employee 
names and the movies’ names, genres, 
and ratings, yet the tables include hire 
dates, salary information, and other 
data. Furthermore, both reports need 
to access the same set of tables, so you 
don't want to write the same code twice. 

Thankfully, as Listing 2 shows, you can create a 
view that contains only the information you need, all in 
one spot. The vMovieReview view contains three inner 
joins; the MovieReview table is joined to the Movie, 
Genre, and Employee tables. The view contains only 
those columns needed, so the view doesn't contain any 
extraneous information. 

With the vMovieReview view, writing the reports 
is a piece of cake. To produce the first report 
that shows the least-popular movies, you write the 


query 
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SELECT Movie,Genre, LISTING 2: Code to Create the 
Stars, EmployeeName vMovieReview View 
FROM vMovieReview 
à USE MyDB 
WHERE Rating - 1 IF (OBJECT ID('vMovieReview') IS NOT NULL) 
ORDER BY Movie, DROP VIEW vMovieReview 
EmployeeName 0 
CREATE VIEW vMovieReview 
To produce the second report that shows the three top Аз Р 
s К К SELECT m.Movie, g.Genre, 
movie reviewers, you write the query LEFT(REPLICATE(‘* ',mr.Stars),10) AS 'Stars', 


mr.Stars AS 'Rating', 
e.FirstName + ' * + e.LastName AS 'EmployeeName' 


SELECT TOP 3 FROM MovieReview mr 


EmployeeName, INNER JOIN Movie m ON m.MovieID - mr.MovieID 
COUNT(*) AS 'Reviews' INNER JOIN Genre g ON g.GenreID - m.GenreID 
. . INNER JOIN Employee e ON e.EmployeeID = 
FROM vMovieRevi ew mr.EmployeeID 
Group BY EmployeeName 60 


ORDER Ву 2 DESC 


Executing these two queries pro- 


duces the reports in Figure 2 and i Emp loyeeName 
Figure 3, respectively. Bridge over 06 River Motherboard Drama 4 Raoule Teteblanche 
Now let’s compare the tw z Defragger Hi Documentary Garret Testerson 
Я 9 И USES ра ፍ 1g o que Defragger Hill Documentary Raoule Teteblanche 
ries you just ran against the vMovie- Defragger Hill Documentary * William McEvoy 
Revi : ith th Hes that Meet the Clusters Comedy Bill Diamond 
eview view with the queries tha Meet the Clusters Comedy e Garret Testerson 
would need to run against the Meet the Clusters Comedy * Garth Vader 
. Planet Of The Ape-Like DBA's Horror ad 8111 Diamond 
physical tables. To produce the first = of e Ape-Like a Horror * Garth Vader 
; ; The User Who Knew Too Muc Horror * Garret Testerson 
report using the physical tables, the V for Vendor Sci-Fi Garret Testerson 


Garret Testerson 
william McEvoy 


ጁ ጁ OF 


query would be When Harry Re-Indexed Sally's Table Documentary 
When Harry Re-Indexed Sally's Table Documentary 


SELECT m.Movie, g.Genre, 


Figure 2 
LEFT(REPLICATE('* ',mr. INNER JOIN Employee e 
Stars) ,10) ON e.EmployeeID ፣ ሻሻ. 
ploy movies 
AS 'Stars', = mr.EmployeeID 
mr.Stars AS ‘Rating’, GROUP BY LEFT(e.FirstName + 
LEFT(e.FirstName + ' * + ' C + e, LastName, 18) 
e.LastName , 18) ORDER BY 2 DESC 
AS 'EmployeeName' 
FROM MovieReview mr The two queries that ran against the view are clearly 
INNER JOIN Movie m shorter and less complicated than the queries that 
ON m.MovieID - mr.MovieID you'd need to run against the physical tables. With 
INNER JOIN Genre g less code to write, the chance of 
ON g.GenreID - m.GenreID making errors such as typos is 
INNER JOIN Employee e reduced. EmployeeName Reviews 
ON e.EmployeeID - mr.EmployeeID 
WERE Stars = 1 The E 
ORDER BY Movie, Disadvantages william McEvoy 10 
EmployeeName As stated previously, a view is 
a virtual table. A view's virtual Figure 3 
To produce the second report using the physical tables, nature is its greatest strength The top three movie 
the query would be and its greatest weakness. Because the contents of reviewers 
a view are determined at runtime, every time you 
SELECT TOP 3 reference a view, the SELECT statement that was 
LEFT(e.FirstName + ' ' + used to define the view must be executed because 
e.LastName , 18) the view exists only in temporary storage. (Note that 
AS 'EmployeeName', there is one exception: indexed views. See the sidebar 
COUNT (*) "Indexed Views vs. Regular Views," page 34, for more 
FROM MovieReview mr information.) 
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Indexed Views vs. Regular Views 


SQL Server 2005 and SQL Server 2000 support indexed views, which are 
also known as materialized views. When a unique clustered index is added 
to a view, the resulting indexed view materializes and persists the data in 
physical storage. Unlike regular views, which are virtual and take up only 
temporary storage space at runtime, indexed views are permanent and 
require permanent disk storage. You can read more about indexed views 
in the SQL Server 2005 Books Online (BOL) documentation at www 
-microsoft.com/technet/prodtechnol/sql/2005/impprfiv.mspx. 
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Excel/Word/PDF Exports 

Take reports with you in any of the 
supported formats: send them out, 
print them out, or just analyze the 
numbers. 


Dashboards 
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to create dashboard, use the controls 
anywhere you need to examine 
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www.datadynamics.com 


Tel: (614 895-3142 


Consider the following sce- 
nario: Some developers at a 
company create a view that per- 
forms complex transformations 
on data derived from multiple 
tables. This view is the source 
for an online historical sales 
report. Hundreds of regional 
managers run this report at 
least once a day, complaining 
all the while about the sluggish- 
ness of the system. 

Each time this report is executed, the view 
is executed. The view, in turn, reads the original 
data, transforms it, then stores the transformed 
data in temporary storage. So, the original data 
is being transformed hundreds of times each day 
even though that data hasn't changed. Although 
the developers saved themselves some coding time 
by using a view, they greatly increased the amount 
of work the server must perform daily. 

In this case, the developers ignored the virtual 
nature of the view and treated it as if it were a 
physical table. It would have made more sense from 
a performance perspective to use the view to gen- 
erate the report data once and store the results in a 
physical table. Subsequent report executions could 
then query the physical table, not the view. 

You might find it hard to conceive that a view 
could actually hurt a server’s performance, but it 
happens all too often. For example, a few years 
ago I was troubleshooting a fairly standard per- 
formance problem. After an upgrade, users were 
complaining that a certain application was pain- 
fully slow. I tracked down the problem to a view 
used to populate the application’s main grid. This 
innocent-looking view didn’t merely join a few 
tables together—it joined multiple views together. 
Each view, in turn, joined to other views, and so 
on. When all was said and done, there were more 
than 100 tables being joined together. No wonder 
the grid was slow! 

Just because SQL Server lets you join views to 
other views, it doesn’t mean that you should. Be 
careful with joins and always examine the contents 
of a view before incorporating it into your code. 
Test the view under various scenarios to make sure 
it performs as expected. 


Use Views Wisely 

Views are incredibly useful constructs that can 

either make or break a project. Used wisely, views 

can reduce code redundancy, reduce errors, and 

make your job easier. Used unwisely, views can 

result in performance problems. ፳፲ 
InstantDoc ID 99765. 
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Master SharePoint 
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B rTt m WindowsITPro 
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WHAT YOU'LL LEARN 


Creating Custom Field Types and Controls for SharePoint Sites 
Leverage SharePoint's out-of-the-box field types with custom field con- 
trols, and see how to create a custom field type with a custom control 
and value type as well as adding a specialized design time rendering in 
tools such as SharePoint Designer. 


Creating & Debugging Custom Timer Jobs 

in Windows SharePoint Services 

Clean up old sites, send user-created alerts, and disk-quota warnings 
with custom timer jobs, plus learn about the various deployment and 
installation options. 


Leveraging the Content Query Weh Part to Its Full Potential 

Go farther with the content query Web Part (СОМР) іп MOSS 2007 
by configuring the COWP query and additional filtering options and 
customize the styling/rendering of the result set. 


INSTRUCTION LED BY 
Andrew Connell is a Microsoft MVP and an independent 
consultant with a background in content management 
solutions, Web development, and Microsoft's SharePoint 
technologies. 
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_DAY 48: This is ridiculous. We’re sitting on a giant 
mountain of information, but we can’t use it in any way 
that drives the business forward. We need info we can 
trust. We need info we can act on. 


_We need to get Gil an oxygen tank. 


_DAY 50: I'm taking back control with IBM. Their Information 
On Demand portfolio can help us optimize our business 

by unlocking the real value of our information. Now we have 
everything we need, including software, services and 
storage, for an end-to-end business solution. We can turn 
trusted business information into actionable insight. 


_Did I use the word “actionable”? Yes, yes, I did. I am Ned. 
Ned who aligns business with IT and uses the word “actionable. 


Watch the Information On Demand video at: 


tion Management IBM.COM/TAKEBACKCONTROL/INFO 
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DEVCONNECTIONS WORKSHOPS 
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WEB DEVELOPMENT FOR THE NEW WORLD: 
Power Workshop Power Workshop Power Worksho 
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From the producers of DevConnections comes the most powerful 

; үе ап ; ; ; REAL-WORLD SHAREPOINT DEVELOPMENT: 
series of in depth workshops beginning this Fall in San Francisco. A 212-DAY WORKSHOP 
Attendance is limited. Sign up today! TODD BAGINSKI 


REGISTER TODAY! • www.DevConnections.com/SFworkshops 


Questions & Answers 


Get help from the pros 


Q: I use a disconnected architecture in which sales agents 
take their systems to the field and add new customers. 
How can I create unique primary keys that are easy for 
customers to use? We have a few thousand customers, so an 
integer (rather than a GUID) would be best and easiest for 
the agents. However, I want to leave a printed invoice with 
customers that shows their real customer numbers. 
А: I suggest using the “sales book” technique for this 
architecture. That is, the salesperson is given a preprinted 
set of invoices that include the unique primary key 
value. Back in the office these values can be scanned 
into the system. Because no two sales books have the 
same sequence, you won't have any collisions. You could 
accomplish the same result with a paperless approach 
that preassigns a block of rows to a specific salesperson. 
When the data is entered, the application uses one of 
the prepopulated data rows so there are no INSERT 
operations, just UPDATES. Collision avoidance is 
easy. Because the salesperson “owns” the row, no other 
applications can change the data until populated on the 
server. This occurs when the salesperson gets back in 
touch with the home office and posts updates. 
— William Vaughn 
InstantDoc ID 99385 


Q: I have a report that's hosted on SQL Server 2005 
Reporting Services (SSRS). This report is stored locally 
on field agents’ laptops in a SQL Server Compact Edi- 
tion database that is synchronized with the home-office 
database when the agents have connectivity. How can the 
field agents print the report while on the road? 
А: Visual Studio 20085 Microsoft ReportViewer can 
extract a report from any datasource (e.g., a flat file, a 
Microsoft Excel spreadsheet, a full-blown SQL Server 
database management system—DBMS); any datasource 
with an ODBC, OLE DB, or .NET Data Provider; or 
any object with an [BindingList interface. ReportViewer 
uses a local report processor to parse the report defini- 
tion file just like SSRS, but it doesn’t handle the data 
fetching operations; it assumes you've already done that. 
This means you can add your SSRS Report Definition 
Language (RDL) report to a Visual Studio 2005 project 
(not Visual Studio 2008, until SQL Server 2008 ships) 
and reset the datasources. Then your code must open 
the connection, run the query, produce an ADO.NET 
DataTable (or any IBindingList-enabled structure), and 
point the LocalReport class to it. For more information, 
see my book “Hitchhiker’s Guide to Visual Studio and 
SQL Server" (hitchhikerguides.net/default.aspx). 
— William Vaughn 
InstantDoc ID 993861 
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Q: How do I retrieve values from a table and store the 
results in a string? 

A: The following code provides a simple yet effective 
method for retrieving values from a table and storing 
the results in a string, without having to rely on cursors, 
WHILE loops, or other complex structures: 


declare (string varchar(8000) 
select @string = '' 


select @string = @string + MyColumn 
from MyTable 


print @string 


You need to initialize your string to a non- NULL 
value, or the string will end up as NULL, which is the 
default behavior of NULLs and string concatenation. 
— Bill McEvoy 
InstantDoc ID 99384. 


Q: Sometimes I need to manage data on a client in a 
SQL Server Compact Edition database and keep the 
data current through replication. I don't want to incur 
the expense of using SQL Server Workgroup Edition 
as the replication publisher. What's the best way to keep 
my data in sync? 
A: Microsoft is working on this problem. The 
goal is to bring SQL Server Compact Edition 
into more desktop applications and let you syn- 
chronize data more easily. Microsoft has released 
the new ADO.NET 3.5 Synchronization. Ser- 
vices classes, which let you connect two database 
systems and keep the data current. The ADO 
.NET Synchronization Services classes are smarter 
than remote data access (RDA) and have less over- 
head and less need for an expert DBA than merge 
replication does. Sync Services is a service-based 
API that is coded not unlike ADO.NET and that 
supports heterogeneous databases (1.6., the databases 
don’t have to be the same type or architecture), 
incremental change tracking, conflict detection and 
resolution, and many more features. What Sync 
Services doesn’t do (yet) is support schema changes 
or use on devices. For more information about ADO 
„МЕТ 3.5 Synchronization Services, see SQL Server 2005 
Books Online. For more information about SQL Server 
Compact Edition, see my eBook “Hitchhiker’s Guide 
to Microsoft SQL Server 2005 Compact Edition" (www. 
.hitchhikerguides.net/EBooks/default.aspx). ፳፲ 
— William Vaughn 
InstantDoc ID 99387 
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William Vaughn 
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expert on Visual Studio, SQL Server, Reporting 
Services, and data access interfaces. He is 
also the author of the critically acclaimed 
“Hitchhikers Guide” series. 


September 2008 39 


On the Scent to 


Your 
New 


Job. 
" IT Job Hound 


www.itjobhound.com 


The Leader of the Pack in IT Job Search Web Sites! 


IT Job Hound is the best place to find a job in IT. 
Whether you're a developer, IT Pro, web designer, or 
somewhere in between—we'll help you: 


• Get your anonymous resume in front of employers. Sign up and 


• Set up E-mail Job Alerts that e-mail you the latest jobs that post አር 
you'd be interested in. resume now: 


> Be one of the first 50 to sign 


up and post your resume today 
and you'll receive 10 free song down- 
loads! It’s like getting an entire CD for 


• Find more than your next rung on the ladder—we'll 
help you find a job you love at a company that fits your 
needs, lifestyle, and salary requirements like a glove. 


What are you waiting for? Find your new IT job now at 


FREE! Sign up here: www.windowsitpro. 


www.itjobhound. com com/go/10freesongs 


IT Job Hound is brought to you by the passionate and well-connected people at Windows IT Pro and SQL Server Magazines. 


Employers: Post your jobs on IT Job Hound and reach the most 


qualified candidates in the IT industry. Set up your account and 
post your jobs today at www.windowsitpro.com/go/jobhound 


Database Encryption 


Solutions 


Protect your databases—and your 
company—from attacks or leaks 


G iven recent well-publicized data leaks and beefed- 
up security regulations that require companies 
to publicly disclose when unencrypted data has been 
exposed, all companies not using database encryption 
should be asking themselves why not. At the simplest 
level, database encryption addresses the concern that 
an attacker might get through your network’s other 
defenses or that the attacker might be someone from 
inside the organization. Encrypting data helps keep 
your company’s name from appearing in the headlines 
next to disturbing phrases such as “security breach.” 

However, database encryption isn’t a one-size-fits- 
all solution. Your organization needs to evaluate what 
data needs protection, where the encryption should 
be performed (e.g., in the database, in the application 
where the data lies, above the database), and how much 
performance it’s willing to trade for increased security. 
You might have no choice in these factors if you must 
encrypt data to comply with particular regulatory 
standards such as The Payment Card Industry Data 
Security Standards or the Health Insurance Portability 
and Accountability Act. But how? And with what? The 
product table on page 42 compares features from the 
vendors who participated in our research, if you'd like 
to jump there now. 


Why Not Go Native? 

You could use the built-in encryption of Microsoft SQL 
Server 2005, but that often involves rewriting database 
and front-end code to make encryption work. In SQL 
Server 2008, the Extensible Key Management (EKM) 
feature introduces the option of using EKM-compatible 
third-party products along with builtin SQL Server 
encryption. And SQL Server 2008 will offer transparent 
database encryption (TDE), which is full database-level 
encryption that protects not only columns and rows but 
the data and log files, too. However, not everyone will 
move to SQL Server 2008 right away. 

Additionally, some organizations prefer to use 
non-Microsoft solutions, or they require that encryp- 
tion takes place outside of SQL Server. Also, because 
the best practice for implementing security solutions 
involves a layered approach, an encryption solution 
from a third party on top of SQL Server can provide 
the in-depth defense your database needs. 
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Choosing a Level of Encryption 
After you decide to encrypt your database, you have a 
choice as to how much of it to encrypt. Database encryp- 
tion products generally operate at one of two levels: 

* Column-level encryption works by encrypting 
individual columns in a database. For example, pro- 
tecting stored credit card numbers can be done with 
column-level encryption. 

* File-level encryption, also called whole-database 
encryption, works at the OS level, just above the 
file system, securing underlying files that store data- 
base data. Encryption and decryption are done in 
file-system blocks as the database process reads or 
writes to database files. Because encryption occurs 
independently of the database, modifications to 
applications, database schema, and the network 
aren't necessary to implement such solutions. 


Additionally, some vendors offer encryption at the 
table level and the application level. Vendors wallpaper 
the Internet with white papers arguing for and against 
their various approaches. Your research will determine 
which type works best for your organization. 


Deal Breakers and Deal Makers 
To make the best choice of an encryption solution, you'll 
need to consider who you want to protect data from and 
for whom you are protecting it. Let's look at some of the 
factors that can make or break a given solution. 

Compliance requirements. Compliance needs in your 
industry might drive your selection of an encryption 
solution. For example, PCI standards require encryp- 
tion of data, but other aspects of its requirements, such 
as level of encryption, are open for interpretation. Some 
regulatory groups might require centralized storage of 
keys rather than distributed key storage. 

Hardware versus software. Hardware solution pro- 
viders tout scalability, but some software products also 
offer good scalability. 

Split-key ability. Split-key solutions let two DBAs 
access parts of a key but not the whole key, preventing 
either one from having full access to the encryption 
of the database. If you already have access control, or 
monitoring and auditing of DBA activities, you might 
not require a split-key feature. 


Caroline 
Marwitz 


cmarwitz @ windowsitpro.com) is an 


associate editor for Windows IT Pro 
and SQL Server Magazine, specializing in 
Active Directory, Group Policy, and desktop 


management. 
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Company 


ActiveCrypt 
Software 
www.xpcrypt.com 


NetLib 
203-321-1278 
877-367-1177 
www.netlib.com 


Protegrity 
203-326-7200 


www. protegrity.com 


SafeNet 
650-261-2400 
866-464-7426 
www.safenet.com 


Voltage Security 
650-543-1280 


www.voltage.com 


Vormetric 
888-267-3732 
www.vormetric.com 


DATABASE ENCRYPTION SOLUTIONS 


Key storage. Another consideration with keys is 
where they’re managed and stored. Hardware-based 
solutions store encryption keys on an appliance, 
offering a centralized location. Software-based encryp- 
tion products store encryption keys on the servers 
where the encrypted data is located—a distributed 
approach to key storage—although some software 
vendors offer an optional appliance for key storage. 
Centralized storage offers an easy target for attack, but 
distributed storage is only as good as the most recent 
security patch on your server. 


Ciphers used. Certain algorithms or ciphers are 
stronger or faster to process than others, so performance 
trade-offs might be involved. For example, the DES 
algorithm is considered less secure than other ciphers, 
but it’s faster than the Triple DES (3DES) algorithm; the 
RCS cipher is considered a high-performing algorithm 
that’s also secure. Key size and how the product uses the 
algorithm also affect performance and security. 

Performance. The current accepted wisdom 15 that 
column-level encryption causes a greater performance 
hit than whole-database encryption because column- 


Product Price and Licensing | SQL Server Versions | OSs Supported Type of Encryption Hardware or Support for 
Supported software? SQL Server 2005 
Cell-Level 
Encryption? 
XP_CRYPT 4.4 $399 for single- SQL Server 2008, Windows 055 Column level Software No 
server license; 2005, 2000, and 7.0 
$3,498 for company 
license (all servers) 
NetLib Encryp- From $5,500; SQL Server 2008, Windows NT 4.0 Available іп column- Software; optional Yes, with 
tionizer for SQL licensed per produc- 2005, 2000, and 7.0 апа later level encryption ver- hardware-based key ^ database-level 
Server tion server (developer version (developer sion, database-level ^ management encryption 
also supports MSDE version also encryption version, version 
2000) supports and combined 
Windows 98 version 
and 95) 
Defiance Data Contact vendor SQL Server 2005 and Windows 2003, Column level Software No 
Protection 2000 Windows XP, and 
System (DPS) Windows 2000 
SafeNet Data- Contact vendor SQL Server 2005 Windows 055 Column level, file Both No 
Secure i116 level 
Voltage Secure- _ From $35,000; All SQL Server ver- Windows (all Uses format- Software; can also Yes 
Data licensed by modules sions (and Oracle, versions) Linux, ^ preserving encryption Бе purchased as 
needed and applica- 082, Sybase) Linux for zSeries, ^ to store data in a preconfigured 
tion use AIX, and Solaris its original format appliance; optional 
whether at column Hardware Security 
level or application Module (HSM) for 
level root key storage 
Vormetric Data From $50,000; All SQL Server ver- Windows 2003, File level and table Both Yes (with 
Security Expert licensed per CPU sions XP, and Win2K level, including Vormetric 
configuration files Key Security 
and logs Expert) 
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DATABASE ENCRYPTION SOLUTIONS 


level encryption works within SQL Server rather than 
between SQL Server and the file system. Know exactly 
what to expect and how your organization is prepared 
to deal with any trade-offs between performance and 
security. 


Customer Service, Cost, 

and Complexity 

Finally, as with any solution, you'll want to consider 
customer service and cost. Does the solution provider 
offer multiple ways for you to contact its support 


people? And do they respond to problems efficiently 
during your evaluation period? Along with the actual 
cost of the product, consider the time required to 
implement it. Will intensive training be necessary? 
Will administration be easy to integrate with other 
IT duties? Implementing any encryption solution for 
your databases is sure to add some complexity to 
management—but if you find a product that meshes 
well with your management style, you're well on your 
way to more secure data. SQL} 

InstantDoc ID 99744. 


Key Lengths 
Used 


Where Are Keys Stored? Additional Features 


Split Key 
Ability? 


If Server-Based, 
Works on Clus- 


SQL Server 2008 | SQL Server Used 


Transparent Data | 2008 EKM? 
Encryption? 


tered Servers? 


No No 3DES, RSA, RC4, Data 128, 192, 256 Tables, file system, registry Мо Yes 
Encryption Standard bits for AES; 
X (DESX), Digital 256 to 8192 
Signature Algorithm bits for RSA 
(DSA), Secure Hash and DSA 
Algorithm-1 (SHA-1), 
SHA-256, SHA-512, 
MD5 
Yes Yes AES, Blowfish, 3DES 64, 128, and Flexible key storage Yes Yes 
256 including encrypted key file 
on hard drive; encrypted 
file on remote drive; on 
device such as USB key; 
in encrypted SQL column; 
additional options available 
through API 
Contact vendor Contact vendor  3DES, AES-128, 128, 256 Keys are stored centrally in Үеѕ Yes 
AES-256, and other a security policy repository 
industry standards 
No No Advanced Encryption 512, 1024, All cryptographic pro- Yes МА 
Standard (AES), 3DES, 2048 asym- cessing, key management, 
DES, RSA, metric key logging, auditing, and 
RC4, SHA-1, Hash Mes- sizes; 40, 56, security policies on a 
sage Authentication 128, 168, 192, single hardened 
Code (HMAC)-SHA-1 256 symmetric — appliance 
key sizes 
Yes Yes Format-Preserving Configurable, ^ Keys generated on demand; Yes Yes 
Encryption, AES-128, default is Stateless Key Management 
AES-256, 3DES, 256-bit AES means no storage needed; 
Identity-Based Encryp- key Root Keys can be option- 
tion, RSA ally stored in a FIPS Rated 
Hardware Security Module. 
Yes (with No AES, DES, 3DES 128, 256 Keys are centrally stored in Yes Yes 
Vormetric Key the hardened appliance; 
Security Expert) also provides the option to 
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securely cache keys locally 


Can be redistributed in a 
software solution; silent instal- 
lation; uses OpenSSL so there 
are no export limitations; data 
can be decrypted outside of 
SQL Server using ActiveX; also 
available for X64 and 1464 
platforms 


Limited access control and 
logging; remote installation in 
developer version; only product 
to offer both column encryption 
and whole-database encryp- 
tion options in one combined 
version 


Access control, logging, remote 
installation, and broad cross- 
platform support including 
open systems, AS400, 2/05, 
Oracle, and more 


Can be integrated at the web 
server, application server, 
database, or file system layer; 
it also offers access control, 
logging, alerting, backup and 
restore, and disaster recovery 


Role-based access to manage- 
ment console; no scale limits; 
can also be used to create 
masked versions of production 
data to separate production 
and test environments from 
data breaches 


Granular access control and 
authentication; logging offered 
for both data access and key 
operations; strong support for 
distributed environments 
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DTS xCHANGE 


Pros: Easy-to-use, wizard-driven DTS 


DTS xChange 


This tool provides a real shortcut 
for your DTS migration package 


P Works’ DTS xChange converts DTS 
packages to native SQL Server Integration Services 
(SSIS) packages for use with SQL Server 2008 and SQL 
Server 2005. DTS xChange takes a three-phase approach 
to DTS-conversion projects: profiling, migration, and 
monitoring. DTS xChange Profiler analyzes existing 
DTS packages, reporting the tasks that comprise the 
packages. For project planning, the profiler lets you enter 
the estimated time that staff will need to complete the 
conversion of each task type, and produces project totals. 
The DTS xChange Migration tool converts the 
packages. Once the conversion is complete, you 
can use the Business Intelligence Development 


ages; profiling took less than a minute. As Figure 1 
shows, DTS xChange Profiler’s report summarized 
the occurrence of each type of DTS task and data 
provider found in the packages, then estimated each 
package’s conversion time and cost. I also opened and 
saved several of the packages on a remote SQL Server 
2000 system. DTS xChange Profiler worked with that 
system as well as it did with local packages. 

The package-migration tool has capabilities that 
SQL Server 2005% DTS Package Migration Wizard 


Studio (BIDS) to manage and modify the 
new SSIS packages. DTS xChange 15 available 
in two editions: one for unlimited use and 
one that performs unlimited migrations of 
50 unique DTS packages. You can also pur- 
chase DTS xChange Profiler as a standalone 


product, and the DTS xChange free trial runs 
with only the Profiler enabled. 


Installation runs quickly on any 32-bit 
Windows Vista, Windows Server 2003, 
or Windows XP system with SQL Server 
20055 SSIS 
components and 
Microsoft .NET 


Figure | 
The DTS xChange Package Profile Report 


migration tool offers more functionality 
than Microsoft’s DTS Migration Wizard; you can 
add support for SSIS features (e.g, Checkpoints); 
outputs standard, modifiable SSIS packages 


Cons: Very little documentation available; falls 
short of 100 percent conversion 


Rating: ewe ом 


Price: $4,995 for unlimited edition; $895 
for 50-package migration limit; $495 for DTS 
xChange Profiler only 


Recommendation: You'll still have some 

work to do after you migrate packages, but DTS 
xChange will save you a lot of time. You’d be crazy 
to undertake a DTS migration project without at 
least giving this product a try. 


Contact: Pragmatic Works • www.dtsxchange 
.com • 617-532-0342 
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Framework 2.0 installed, 
but documentation is 
sparse. A Quick Start 
Guide in PDF format 
is the only documenta- 
tion that accompanied 
the product, and the link 
to online help led to an 
"under construction" 
web page. However, the 
Quick Start Guide got me 
started. 

For my testing, I down- 
loaded and used a set of 
DTS packages supplied 
by Pragmatic Works. I 
ran DTS xChange Pro- 
filer against the test pack- 


lacks, including the ability to migrate Dynamic Prop- 
erties tasks. The migration tool also handles most 
flat-file scenarios and supports migration of ODBC 
connections. 


A Work In Progress 
DTS xChange is a work in progress; in fact, Pragmatic 
Works released several enhancements during the 
writing of this review. As it stands, the product has 
some weaknesses. Documentation is sparse, and in my 
tests, not every DTS package migrated completely. 
Still, DTS xChange does a much better job than the 
Microsoft DTS Migration Wizard, and your SSIS pack- 
ages require little effort to complete. If you want a tool 
that eliminates many hours of labor and lets you take 
advantage of new SSIS facilities, DTS xChange is for 
you. ፳፲ 
InstantDoc 10 98998, 
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Bytes from the Blog 
www.sgimag.com/go/industrybytes 


Microsoft to Purchase Zoomix 


М icrosoft announced that it has inked an agreement 
to purchase Zoomix, a Jerusalem-based company 
that creates software to automate the delivery of correct, 
complete, and synchronized enterprise data within the 
business workflow. When the acquisition is complete, the 
Zoomix development team will join the Microsoft R&D 
group in Israel to work on SQL Server solutions. 
Zoomix Accelerator is Zoomix's only product. 
According to the vendor, Zoomix Accelerator is “а 
scalable, high-performance, automated data processing 


**Zoomix will contribute a 
critical layer of data improve- 
ment to Microsoft's SQL 
Server data platform, [which] 
will enable organizations 

to meet the requirements 

of complex information 
systems and streamline 
business processes." 


Drowning in Data 


[ a recent Business Performance Management 
Roos Forum survey sponsored by network 
storage provider BlueArc and titled “Uptime (ወ 
Crunch Time—Valuing the Need for Data Speed at 
Critical Business Inflections,” more than 80 percent 
of respondents said that storage performance and 
data access are important to their business. However, 
78 percent of respondents were concerned that they 
weren't prepared to address significant spikes in 
demand for data. 

I spoke with Jon Affeld, BlueArc’s senior director of 
product marketing, about the study. Affeld said that the 
BPM Forum surveyed more than 125 IT professionals 
earlier this year to get their views on storage networking. 
According to the BPM Forum, the purpose of the study 
was to assess "the importance of data performance to 
business success and the impact of downtime during 
peak periods of massive information processing." 

The study showed that many businesses aren't pre- 
pared to handle the increasing data volumes and user 
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server that solves data inconsistencies in-line with busi- 
ness processes." Microsoft has acquired several com- 
panies (and entered into a few licensing agreements) 
in the business intelligence (BI) space over the past 
12 months, and the Zoomix acquisition underscores 
Microsoft's attempts to bolster its BI portfolio. 

Moshe Lichtman, vice president of Microsoft 
International and president of R&D in Israel, notes 
that “Microsoft’s additional investment in Israel 
takes our R&D center into the field of solutions for 
databases, a field that 1s growing rapidly and 15 very 
strategic for Microsoft International. Zoomix will 
contribute a critical layer of data improvement to 
Microsoft's SQL Server data platform. This capability 
will enable organizations to meet the requirements of 
complex information systems and to streamline busi- 
ness processes, and is therefore expected to have a great 
impact on the organization's bottom line." 

To learn more about Zoomix, visit www.zoomix 
.com. 


—Sheila Molnar 
InstantDoc ID 99753 


demand over the next year. Key survey findings include 

* More than a third of respondents anticipated that 
their storage needs will grow by at least 50 percent 
in the next two years. 

* More than 25 percent of respondents said that 
poor data performance has negatively affected their 
organizations. 

* Fifty-five percent of respondents said that short- 
term glitches of even one day could be catastrophic 
to their business. 


According to Affeld, implementing the right stor- 
age infrastructure allows companies to reduce 
application wait times, handle more concurrent 
users, and provide those users with faster access to 
rich media. To learn more about the “Uptime @ 


Crunch Time" study, go to www.uptimeatcrunchtime 


„com. To find out more about the BPM Forum, visit 
www.bpmforum.org. ЕП 
—Megan Веапу 

InstantDoc ID 99543 
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9 Editor's Tip 


Өз». 


Got а great 
new product? 
Send announce- 
ments to products@ 


sqlmag.com 


—Jeff James, 
senior editor 


PERFORMANCE 
MONITORING 

Optimize SQL Server Performance 
SQL Sentry has announced the availability of its new 
performance monitoring and optimization tool, SQL 
Sentry Performance Advisor. Performance Advisor fea- 
tures include a performance dashboard, real-time and 
historical analysis of performance metrics, graphical 
monitoring of disk activity, blocking and deadlock 
analysis, and automated tracing. According to the com- 
pany, these features “simplify the process of optimizing 
SQL Server performance.” Licenses are available for $1,495 per SQL Server instance plus an annual maintenance 
fee of $299. For more information, contact SQL Sentry at sales@sqlsentry.net or go to www.sqlsentry.net. 


DATABASE PROFILING 

Cost-Effective Database Profiling 

AMB Dataminers has announced AMB-InfoPublish, with support for IBM DataStage/Information Server extrac- 
tion, transformation, and loading (ETL) tools. The company says that AMB-InfoPublish provides a cost-effective 
way for administrators to profile entire SQL Server databases. The product also profiles Oracle, Teradata, flat-file, 
and DB2 (UDB, DB/400, zOS) databases. AMB-InfoPublish support integrates with Microsoft SQL Server Inte- 
gration Services (SSIS) and IBM DataStage. According to Steven Meister, president of AMB, “AMB-InfoPublish 
profiling ... allows users to independently profile new sources of information and drop it into DataStage projects 
when needed. ETL developers will have profiling information right in each stage.” For more information, contact 
AMB at 800-928-0365 or visit www.payasyougodataquality.com. 


DATABASE SECURITY 
Patch Databases Rapidly 


— M Hedgehog vPatch, Sentrigos new virtual patching software, protects 


databases against vulnerabilities as soon as they're discovered. Hedgehog 
vPatch fortifies corporate databases against recently discovered security 
problems without incurring database downtime or affecting related appli- 
cations. Sentrigo typically patches newly discovered vulnerabilities within 
days and automatically delivers updates to Hedgehog vPatch. Companies 
can configure Hedgehog vPatch to prevent intrusions by terminating or 
quarantining sessions or to generate alerts. The product currently supports 
Oracle and Microsoft databases. Pricing begins at $750 per CPU for an 
annual subscription. For more information, contact Sentrigo at 781-935- 


2879 or visit www.sentrigo.com. 
CHANGE MONITORING AND 
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CAPACITY PLANNING 

Manage Growth and Change in Your SQL Server Environment 

Quest Software has released the latest version of two of its SQL Server management products, Capacity Manager 

for SQL Server 2.5 and Change Director for SQL Server 2.0. Capacity Manager helps you manage the growth 

of your SQL Server environment by providing capacity planning for all your SQL Server databases and related 

applications, such as SharePoint. Capacity Manager also includes intelligent index maintenance and partition 

management. With Change Director, you can automate the deployment and migration of database schema 

changes. In addition, Change Director now offers middle-tier processing, which lets you offload schema com- 

parisons, script generation, and job scheduling to a persistent processing tier for better performance. For more 

information, contact Quest Software at 800-306-9329 or visit www.quest.com. SQL] 
InstantDoc ID 99710_ 
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T-SQL Enhancements 
in SQL Server 2008, Part 2 


S Q tant T-SQL enhancements, 1 ran out 


of space last month describing them (see “T-SQL 
Enhancements in SQL Server 2008,” InstantDoc 
ID 99416). Now let's look at more T-SQL enhance- 
ments you might find useful. 


0x4D69636861656C, ፀን AS ‘Style 0' 
SELECT CONVERT (char(8) , 
0x4D69636861656C, 1) AS ‘Style 1’ 
SELECT CONVERT (char (8) , 
0x4D69636861656C, 2) AS ‘Style 2’ 


Michael Otey 


motey@ sqlmag.com) is technical director 
for Windows IT Pro and SQL Server Magazine 


and author of Microsoft SQL Server 2008 New 
Features (Osborne/McGraw-Hill). GROUPING SETS Operator 

The new GROUPING SETS operator lets you specify 

multiple groupings of data (or grouping sets) in a single 

query and gain 

more control over 


Table-Valued Parameters 
፲፻ you've ever needed to pass a large number of 
parameters to a stored procedure or func- 
tion, you'll appreciate SQL Server 2008's 
support for table-valued parameters. 

As you can see in Listing 1, I created 


LISTING |: Table- 
Valued Parameters 


CREATE TYPE MyTableType AS 

TABLE (ID int, Item магсћаг(20)) 
GO 

CREATE Procedure usp MyProc 
(@MyTableIn MyTableType READONLY) 
AS 


LISTING 2: Operator 
the aggregations for GROUPING SETS 


the query returns. Use AdventureWorks 
a new Table Type called MyTableType. 


In Listing 2,1 © 
Next, I created a stored procedure that 


SELECT Name, DATEPART(yy,DueDate) 
used a SELECT 
SELECT * FROM GMyTableIn uses a table as a parameter. Last, you 


As Year, SUM(TotalDue) As Total 
с 5 statement to return FROM Sales.SalesOrderHeader 0 


JOIN Sales.Store AS S ON 


DECLARE @MyTableIn MyTableType; can see the new MyTableType variable three columns from S.CustomerID = O.CustomerID 
INSERT INTO GMyTableIn VALUES GROUP BY GROUPING SETS 
(1, 'Bike") that was populated and passed to the the SalesSalesOr- : 
Бродар дет usp_MyProc stored procedure. derHeader table (Name, DATEPART(yy,DueDate)), 
d the Sales.Store Ви 
i ап. | 
Enhanced CONVERT Function ) D 


table, aggregating 

the total for the 

TotalDue column. The GROUPING SETS clause creates 

three grouping sets: results by Name and DueDate, results 

by Name, and the entire result set. ፳፲ 
InstantDoc ID 99778 


The CONVERT function now supports multiple 
conversion styles, giving you more control over the 
output of the function. I’ve used the new styles in the 
following example: 


SELECT CONVERT (char(8) , 


ዳኤ. 
S you just can't say it better than InstantDoc ID 96975 

somebody else already has. In a September "SSRS and MOSS 
2007 Letter to the Editor (InstantDoc ID 96635), Ins 
Roger Reid wrote “а SQL Server admin supporting “Е 
SharePoint needs to know a fair t about 
дет " To learn more about Microsoft Office 
oint Server (MOSS) and Windows SharePoint 
—— (WSS), check out these resources. 


፡ Deploying the Power,” 


ling Information-Management Policies,” 
ос ID 97156 
ion Integration: SSRS and MOSS 2007,” 


MI 
MI 


Christan 


For in-depth SharePoint workshops: Humphries 


For basic information about SharePoint: 

“Windows SharePoint Services 3.0 Out of the Box,” 
InstantDoc [р.94240  - 

“What Can | Accomplish with Other SharePoint 
Technologies?” InstantDoc ID 96841 


On September 30th and October 151, SharePoint 
MVPs Dan Holme, Michael Noel, and Andrew Con- 
nell share their real-world perspective, experience, 
and expertise. You'll learn how to build a better 
SharePoint infrastructure, develop more effective 


(christan.humphries @ penton.com) is a 


Windows IT Pro associate and regular 
contributor to Windows IT Pro, SQL 
Server Magazine, and associated websites, 
specializing in the Windows IT Pro net- 


work of tools and resources. 


SharePoint applications, and enable more powerful 
collaboration. To register, go to www.windowsitpro 
.com/go/elearning/gotSharePoint. 


For information about how MOSS and SQL Server 
work together: 
“Migrating Reporting Services to MOSS,” 


SQL Server Magazine, September 2008.Vol. 10, No. 9 (ISSN 1522-2187). SQL Server Magazine is published monthly by Penton Media, Inc., copyright 2008, all rights reserved. SQL Server is a regis- 
tered trademark of Microsoft Corporation, and SQL Server Magazine is used by Penton Media, Inc., under license from owner. SQL Server Magazine is an independent publication not affiliated with 
Microsoft Corporation. Microsoft Corporation is not responsible in any way for the editorial policy or other contents of the publication. SQL Server Magazine, 221 E. 29th St., Loveland, CO 80538, 
800-621-1544 or 970-663-4700. Sales and marketing offices: 221 E. 29th St., Loveland, CO 80538. Advertising rates furnished upon request. Periodicals Class postage paid at Loveland, Colorado, and 
additional mailing offices. Postmaster: Send address changes to SQL Server Magazine, PO. Box 447, Loveland, CO 80539-0447. Subscribers: Send all inquiries, payments, and address changes to 
SQL Server Magazine, Circulation Department, P.O. Box 447, Loveland, CO 80539-0447. Subscriptions in US, $54.95 for one year; in Canada, US $59, plus 6% for GST for one year; in UK £59 for 
one year; in all other countries, US $99 for one year. Single-copy price in US, $5.95; in Canada, US $7.95. Printed in the U.S.A. BPA Worldwide Member. 


48 September 2008 SQL Server Magazine * www.sqlmag.com 


Achieve Ultimate Microsoft® 
SQL Server® Performance 
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Index Defragmentation 


m+ eel 2 %- беп SOL вета manger (RepostorePMBS 91: - (Fragmercrancn trpio 


SOL defrag manager Version 2.0: 
FIND FRAGMENTED INDEXES AND FIX THEM 


With the new "hands-free" automation in SOL defrag manager v2.0, you'll not only know exactly 
where fragmentation is, you'll also be able to defrag every index, table, database, or SOL Server 


automatically – even while you sleep! 


=> Defrag now! Free for 14 days! 


Download: www.idera.com/defra 


(Сега 


© 2008 BBS Technologies, Inc. All rights reserved. Idera and SOL admin toolset are trademarks 
or registered trademarks of BBS Technologies, Inc. All other brand or product names are 
trademarks or registered trademarks of their respective holders. 


